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1. Introduction 

The so called Skorokhod embedding problem or Skorokhod stopping problem was 
first formulated and solved by Skorokhod in 1961 (114j (English translation 
in 1965 For a given centered probability measure /i with finite second 

moment and a Brownian motion B, one looks for an integrable stopping time 
T such that the distribution of Bt is /x. This original formulation has been 
changed, generalized or narrowed a great number of times. The problem has 
stimulated research in probability theory for over 40 years now. A full account 
of it is hard to imagine; still, we believe there is a point in trying to gather 
various solutions and applications in one place, to discuss and compare them 
and to see where else we can go with the problem. This was the basic goal set 
for this survey and, no matter how far from it we have ended up, we hope that 
it will prove of some help to other researchers in the domain. 

Let us start with some history of the problem. Skorokhod's solution required 
a randomization external to B. Three years later another solution was proposed 
by Dubins |3T] (see Section 1?^ . which did not require any external randomiza- 
tion. Around the same time, a third solution was proposed by Root. It was part 
of his Ph.D. thesis and was then published in an article |103| . This solution has 
some special minimal properties, and will be discussed in Section [3 

Soon after, another doctoral dissertation was written on the subject by Mon- 
roe who developed a new approach using additive functionals (see Section 15^ . 
His results were published in 1972 \7^. Although he did not have any explicit 
formulae for the stopping times, his ideas proved fruitful as can be seen from 
the elegant solutions by Vallois (1983) (see Section l3.12|l and Bertoin and Le 
Jan (1992) (see Section n^.l5|l . which also use additive functionals. 

The next landmark was set in 1971 with the work of Rost He gener- 

alized the problem by looking at any Markov process and not just Brownian 
motion. He gave an elegant necessary and sufficient condition for the existence 
of a solution to the embedding problem (see Section . Rost made extensive 
use of potential theory. This approach was also used a few years later by Cha- 
con and Walsh ^^1, who proposed a new solution to the original problem which 
included Dubins' solution as a special case. 

By that time, Skorokhod embedding, also called the Skorokhod represen- 
tation, had been successfully used to prove various invariance principles for 
random walks (Sawyer (llOj V It was the basic tool used to realize a discrete 
process as a stopped Brownian motion. This ended however with the Komlos, 
Major and Tusnady [23 construction, which proved far better for this purpose 
(see Section ril.2f) . Still, the Skorokhod embedding problem continued to inspire 
researchers and found numerous new applications (see Sections 1101 and [TT|l . 

The next development of the theory came in 1979 with a solution proposed 
by Azema and Yor [3'. Unlike Rost, they made use of martingale theory, rather 
than Markov and potential theory, and their solution was formulated for any 
recurrent, real-valued diffusion. We will see in Section [S] that their solution can 
be obtained as a limit case of Chacon and Walsh's solution. Azema and Yor's 
solution has interesting properties which are discussed as well. In particular the 
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solution maximizes stochastically the law of the supremum up to the stopping 
time. This direction was continued by Perkins |90|. who proposed his own solu- 
tion in 1985, which in turn minimizes the law of the supremum and maximizes 
the law of the infimum. 

Finally, yet another solution was proposed in 1983 by Bass i4|. He used the 
stochastic calculus apparatus and defined the stopping time through a time- 
change procedure (see Section r3.11|) . This solution is also reported in a recent 
book by Stroock ([Tn|, p. 213-217). 

Further developments can be classified broadly into two categories: works 
trying to extend older results or develop new solutions, and works investigating 
properties of particular solutions. The former category is well represented by 
papers following Monroe's approach: solution with local times by Vallois |118| 
and the paper by Bertoin and Le Jan |^, where they develop explicit formulae 
for a wide class of Markov processes. Azema and Yor's construction was taken 
as a starting point by Grandits and Falkner and then by Pedersen and 
Peskir [M] who worked with non-singular diffusions. Roynette, Vallois and Yor 
|lU8j used Rost criteria to investigate Brownian motion and its local time > 1(J8) . 
There were also some older works on n-dimensional Brownian motion (Falkner 
|36| ) and right processes (Falkner and Fitzsimmons |37|V 

The number of works in the second category is greater and we will not try to 
describe it now, but this will be done progressively throughout this survey. We 
want to mention, however, that the emphasis was placed on the one hand on the 
solution of Azema and Yor and its accurate description and, on the other hand, 
following Perkins' work, on the control of the maximum and the minimum of 
the stopped martingale (Kertz and Rosier [HI], Hobson [Hi], Cox and Hobson 

All of the solutions mentioned above, along with some others, are discussed 
in Section|21 some of them in detail, some very briefly. The exceptions are Azema 
and Yor's, Perkins' and Root's solutions, which are discussed in S ect ions [Eland 
[3 respectively. Azema and Yor's solution is presented in detail with an emphasis 
on a one-dimensional potential theoretic approach, following Chacon and Walsh 
(cf. Section r3.8|) . This perspective is introduced, for measures with finite support, 
in Section 2) where Azema and Yor's embeddings for random walks are also 
considered. Perkins' solution is compared with Jacka's solution and both are 
also discussed using the one-dimensional potential theoretic approach. Section 
[3 devoted to Root's solution, contains a detailed description of works of Root 
and Loynes along with results of Rost. The two sections that follow. Sections [S] 
and [21 focus on certain extensions of the classical case. The former treats the 
case of non-centered measures and uniform integrability questions and the latter 
extends the problem and its solutions to the case of real-valued diffusions. The 
last two sections are concerned with applications: Section EH develops a link 
between the Skorokhod embedding problem and optimal stopping theory, and 
the last section shows how the embedding extends to processes; it also discusses 
some other applications. 

To end this introduction, we need to make one remark about the terminol- 



J. Obioj/Skorokhod embedding problem 



326 



ogy. Skorokhod's name is associated with several "problems" . In this work, we 
are concerned solely with the Skorokhod embedding problem, which we call also 
"Skorokhod's stopping problem" or simply "Skorokhod's problem". As men- 
tioned above, this problem was also once called the "Skorokhod representation" 
(see for example Sawyer Freedman P- 68) but this name was later 

abandoned. 

2. The problem and the methodology 

As noted in the Introduction, the Skorokhod embedding problem has been gen- 
eralized in various manners. Still, even if we stick to the original formulation, 
it has seen many different solutions, developed using almost all the major tech- 
niques which have flourished in the theory of stochastic processes during the 
last forty years. It is this methodological richness which we want to stress right 
from the beginning. It has at least two sources. Firstly, different applications 
have motivated, in the most natural way, the development of solutions formu- 
lated within different frameworks and proved in different manners. Secondly, 
the same solutions, such as the Azema-Yor solution, were understood better as 
they were studied using various methodologies. 

Choosing the most important members from a family, let alone ordering 
them, is always a risky procedure! Yet, we believe it has some interest and 
that it is worthwhile to try. We venture to say that the three most important 
theoretical tools in the history of Skorokhod embedding have been: a) potential 
theory, b) the theory of excursions and local times, and c) martingale theory. 
We will briefly try to argue for this statement, by pointing to later sections 
where particular embeddings which use these tools are discussed. 

As we describe in Section lT^ below. potential theory has two aspects for us: 
the simple, one-dimensional potential aspect and the more sophisticated aspect 
of general potential theory for stochastic processes. The former starts from the 
study of the potential function U^{x) — — J \x — y\d^{y), which provides an 
convenient description of (centered) probability measures on the real line. It 
was used first by Chacon and Walsh (see Section and proved to be a simple 
yet very powerful tool in the context of the Skorokhod problem. It lies at the 
heart of the studies performed by Perkins (see Section IB)) and more recently by 
Hobson [SI] and Cox and Hobson (see Section lB.20|l . We show also how it yields 
the Azema-Yor solution (see Section |OJ- 

General potential theory was used by Rost to establish his results on the 
existence of a solution to the Skorokhod embedding problem in a very general 
setup. Actually, Rost's results are better known as results in potential theory 
itself, under the name "continuous time filling-scheme" (see Section IXSjl . They 
allowed Rost not only to formulate his own solution but also to understand 
better the solution developed by Root and its optimal properties (see Section 
Cj). In a sense. Root's solution rehes on potential theory, even if he doesn't use 
it explicitly in his work. Recently, Rost's results were used by Roynette, Vallois 
and Yor jl08| to develop an embedding for Brownian motion and its local time 
at zero (see Section lH.18|l . 
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A very fruitful idea in the history of solving the embedding problem is the 
following: compare the realization of a Brownian trajectory with the realization 
of a (function of) some well-controlled increasing process. Use the latter to decide 
when to stop the former. We have two such "well-controlled" increasing processes 
at hand: the one-sided supremum of Brownian motion and the local time (at 
zero). And it comes as no surprise that to realize this idea we need to rely either 
on martingale theory or on the theory of local times and excursions. The one- 
sided supremum of Brownian motion appears in Azema and Yor's solution and 
their original proof in used solely martingale-theory arguments (see Section 
I5.4|l . Local times were first applied to the Skorokhod embedding problem by 
Monroe (see Section and then used by Vallois (see Section 15. 12|) . Bertoin 
and Le Jan (see Section [3. 15(1 used additive functionals in a very general setup. 
Excursion theory was used by Rogers jl01| to give another proof of the Azema- 
Yor solution. It is also the main tool in a recent paper by Obloj and Yor |84j . 

We speak about a solution to the Skorokhod embedding problem when the 
embedding recipe works for any centered measure /i. Yet, if we look at a given 
description of a solution, it was almost always either developed to work with 
discrete measures and then generalized, or on the contrary, it was designed to 
work with diffuse measures and only later generalized to all measures. This 
binary classification of methods happens to coincide nicely with a tool-based 
classification. And so it seems to us that the solutions which start with atomic 
measures rely on analytic tools which, more or less explicitly, involve potential 
theory and, vice versa, these tools are well-suited to approximate any measure 
by atomic measures. In comparison, the solutions which rely on martingale 
theory or the theory of excursions and local times, are in most cases easily 
described and obtained for diffuse measures. A very good illustration for our 
thesis is the Azema- Yor solution, as it has been proven using all of the above- 
mentioned tools. And so, when discussing it using martingale arguments as in 
Section lOl or arguments based on excursion theory (as in Rogers [lOlp . it is 
most natural to concentrate on diffuse measures, while when relying on one- 
dimensional potential theory as in Section 15.11 we approximate any measure 
with a sequence of atomic measures. 

2. 1 . Preliminaries 

We gather in this small subsection some crucial results which will be used 
throughout this survey without further explanation. B = [Bt : i > 0) always 
denotes a one-dimensional Brownian motion, defined on a probability space 
(Jl, JT, P) and Tt = c(i?s : s < t) \s its natural filtration, taken completed. 
The one-sided supremum process is St = sup„<j Bu and the two-sided supre- 
mum process is B^ — sup„<j Let T denote an arbitrary stopping time, 
Tx = inf{t > : Bt ~ x} the first hitting time of x, and Ta,b — ini{t > : Bt ^ 
(a, &)}, a < < 6, the first exit time form (a, b). M — {Mt : i > 0) will always 
be a real- valued, continuous local martingale and {{M)t t > Q) its quadratic 
variation process. When no confusion is possible, its one-sided and two-sided 
suprema, and first exit times will be also denoted St, and Ta.b respectively. 
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For a random variable X, C{X) denotes its distribution and X ^ /i, or 
C{X) = fi, means "X has the distribution /i." The support of a measure ^ 

is denoted supp{ii). For two random variables X and Y, X ^ Y and X — Y 
signify both that X and Y have the same distribution. The Dirac point mass 
at a is denoted da, or S^a} when the former might cause confusion. The Normal 
distribution with mean m and variance is denoted Af{m, cr^). Finally, 
signifies weak convergence of probability measures. 

Note that as [Bt/^T^ b)t>^ bounded, it is a uniformly integrable martingale 
and the optional stopping theorem (see Revuz and Yor 99 , pp. 70-72) yields 
EBt„ = 0. This readily implies that P(Bt„ = a) = = 1 - P(St„ , = h). 
This generalizes naturally to situations when Brownian motion does not start 
from zero. This property is shared by all continuous martingales and, although 
it looks very simple, it is fundamental for numerous works discussed in this 
survey, as will be seen later (cf. Section . 

The process — t \s easily seen to be a martingale. By optional stopping 
theorem we see that if ET < oo then ET = Ei3|,. We can actually prove a 
stronger result (see Root |1U3) and Sawyer |110| ): 

Proposition 2.1. Let T he a stopping time such that {BtAT)t>o is a uniformly 
integrable martingale. Then there exist universal constants Cp,Cp such that 



CpE 



j.p/2 



< E 



< CpE 



rpp/2 



forp>l. (2.1) 



Proof. The Burkholder-Davis-Gundy inequalities (see Revuz and Yor j^Hl P- 
160) guarantee existence of universal constants kp and Kp such that kpETP^'^ < 
E[{B^)P] < KpETP/'^, for any p > 0. As (Bt^T : i > 0) is a uniformly inte- 
grable martingale we have sup^ E \BtAT\^ = E |-Bt|'°, and Doob's inequalities 
yield E|Bt|^ < E[(S^)p] < (^)^E|St|p for any p > 1. The proof is thus 

completed taking Cp = f^p(^;~[) and Cp = Kp. □ 



We note that the above Proposition is not true for p = I. Indeed we can 
build a stopping time T such that {Bt^T '. t > 0) is a uniformly integrable 
martingale, E \Bt\ < oo, and eVt = oo (see Exercise II. 3. 15 in Revuz and Yor 

The last tool that we need to recall here is the Dambis-Dubins-Schwarz 
theorem (see Revuz and Yor [5^1, P- 181). It allows us to see a continuous local 
martingale (Mj : t > 0) as a time-changed Brownian motion. Moreover, the 
time-change is given explicitly by the quadratic variation process {{M)t : t > 0). 
This way numerous solutions of the Skorokhod embedding problem for Brownian 
motion can be transferred to the setup of continuous local martingales. 



2.2. On potential theory 

Potential theory has played a crucial role in a number of works about the Sko- 
rokhod embedding problem and we will also make a substantial use of it, es- 
pecially in its one-dimensional context. We introduce here some potential the- 
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oretic objects which will be important for us. As this is not an introduction to 
Markovian theory we will omit certain details and assumptions. We refer to the 
classical work of Blunienthal and Getoor |S] for precise statements. 

On a locally compact space {E, £) with a denumerable basis and a Borel a- 
field, consider a Markov process X = {Xt : t>Q) associated with [P^ : t > 0), 
a standard semigroup of submarkovian kernels. A natural interpretation is that 
vP^ represents the law of Xt, under the starting distribution Xq ^ v. Define 
the potential kernel through = P^dt. This can be seen as a linear 
operator on the space of measures on E. The intuitive meaning is that vU^ 
represents the occupation measure for X along its trajectories, where Xq ~ u. 

If the potential operator is finite^ it is not hard to see that for two bounded 
stopping times, S < T, we have U^{Pg — P^) > 0. This explains how the 
potential can be used to keep track of the relative stage of the development 
of the process (see Chacon [T7^). We will continue the discussion of general 
potential theory in Section with the works of Rost. 

For X = B, a real-valued Brownian motion, as the process is recurrent, the 
potential vU^ is infinite for positive measures v. However, if the measure is a 
signed measure with z/(R) = 0, and / |a;|;/((ia;) < oo, then the potential vU^ is 
not only finite but also absolutely continuous with respect to Lebesgue measure. 
Its Radon-Nikodym derivative is given by 

^^^{x)^- J\x-yHdy). (2.2) 

The RHS is well defined for any probability measure /i on M (instead of ly) with 
/ \x\d^{x) < oo and, with a certain abuse of terminology, this quantity is called 
the one-dimensional potential of the measure /i: 

Definition 2.2. Denote by the set of probability measures on M with fi- 
nite first moment, jj, G Ai iff J \x\dfi{x) < oo. Let denote the subset 
of measures with expectation equal to m, fj, G iff J \x\d^{x) < oo and 
J xdfi{x) = m. Naturally = UmeR-'^m- -^^^ one- dimensional potential op- 
erator U acting from A4 ^ into the space of continuous, non-positive functions, 

U : C(R, is defined through U n{x) := (x) — — \x — y\fi{dy). 

We refer to U fi as to the potential of /i. 

We adopt the notation U n to differentiate this case from the general case 
of a potential kernel when /iC/^ is a measure and not a function. This 
simple operator enjoys some remarkable properties, which will be crucial for the 
Chacon- Walsh methodology (see Section which in turn is our main tool in 
this survey. The following can be found in Chacon ^7], and Chacon and Walsh 

Proposition 2.3. Let to S R and fi € Mi,. Then 

(i) U/J, is concave and Lipschitz- continuous with parameter 1; 
^That is, for any finite starting measure, it provides a finite measure. 
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(ii) Ufj,{x) < (7(5{„ij.(a;) — ~\x — m\ and if v E and Uv < f//i then 

(in) for Ail, ^2 e M}n, ^ira^^i^^ \UfJ,i{x) - U^l2{x)\ = 0; 

(iv) for /i„ G -A^m, IJ-n ^ IJ- if and only if [//i„(a;) > U^{x) for all x e M; 



('wj /or v e AIq, «/ /]g x^z/((ia;) < oo, then j^x'^v(dx) = /jj + Uv{x) 
(vi) for V e Mln, [/i^|[6,oo) = C^/^l[b,oo) j/a?irf onZy i/Ai|(6,oo) = f^l(b,oo); 



(^Diij Zet Bq ^ V, and define p through p ^ Bt^^, then ?7p|(-oo.a]u[6.oo) = 
^'^l(-oo,a]u[b,oo) and Up is linear on [a, 6]. 

Proof. We only prove (ii), (vi) and (vii). Let p € ■^m- The first assertion in 
(ii) follows from Jensen's inequality as 



Up{x) = - \x-y\dp{y)<- / {x - y)dp{y) ^U6^{x) 
To prove the other assertions we rewrite the potential: 



Up{x) = - \x - y\dp{y) ^ - [x - y)dp{y) - {y - x)dp{y) 

= x(2pi[x,(^))-l) +Tn-2 [ ydpiy), (2.3) 

^ ^ J[x,oc) 

where, to obtain the third equality, we use the fact that p G Ai}^ and so 
I[x oo) y^l^iy) = — x) y^f-iy)- ^^^^ second assertion in (ii) now follows. 
For I' e with Uv < Up we have Uv{x) < —\x — m\. Using the expression 
H2.3|l above and letting x — oo we see that xdv{x) > m and, likewise, 
letting a; ^ oo we see that the reverse holds. 

The formula displayed in l|2.3|l shows that the potential is linear on intervals 
[a, b] such that p{{a, b)) = 0. Furthermore, it shows that for i/ E Ai}n, C^i^|[6,oo) = 
^A*l[fc,oo) if and only if p\[b,oo) = ^\{b,oo)- Note that the same is true with [b, oo) 
replaced by (— oo, a] and that in particular Uiy = Up if and only ii v = p. The 
assertions in (vii) follow from the fact that if ^ v, then the law of Bt^^ 
coincides with ly on (— oo, a) U (6, oo) and does not charge the interval (a, b). □ 

2.3. The Problem 

Let us state formally, in its classical form, the main problem considered in this 
survey. 

Problem (The Skorokhod embedding problem). For a given probability 
measure p on R, such that J^\x\dp{x) < oo and Jj^xdp{x) = 0, find a stop- 
ping time T such that Bt ^ p and (Bt^T '■ t > 0) is a uniformly integrable 
martingale. 
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Actually, in the original work of Skorokhod |115| . the variance v = x'^dix{x), 
was assumed to be finite, w < oo, and the condition of uniform integrability of 
{BtAT ■ t > 0) was replaced by ET v. Naturally the latter implies that 
: t > 0) is a uniformly integrable martingale. However the condition 
f < oo is somewhat artificial and the condition of uniform integrability of 
{BtAT ■ t > 0) seems the correct way to express the idea that T should be 
"small" (see also Section (Sj). We point out that without any such condition 
on T, there is a trivial solution to the above problem. We believe it was first 
observed by Doob. For any probability measure ^, we define the distribution 
function F^{x) = cx), a;]) and take its right-continuous inverse. The 
distribution function of a standard Normal variable is denoted The stopping 
time T = mi{t > 2 : Bt — F~^{^{Bi))} then embeds fi in Brownian motion, 
Bt ^ fJ-- However, we always have ET = oo. 

Skorokhod first formulated and solved the problem in order to be able to 
embed random walks into Brownian motion (see Section [ll.l() . We mention that 
a similar question was posed earlier by Harris |52) . He described an embedding 
of a simple random walk on Z (inhomogeneous in space) into Brownian motion 
by means of first exit times. Harris used the embedding to obtain some explicit 
expressions for mean recurrence and first-passage times for the random walk. 

3. A guided tour through the different solutions 

In Section 121 above, we tried to give some classification of known solutions to the 
Skorokhod embedding problem based mainly on their methodology. We intended 
to describe the field and give the reader some basic intuition. We admit that 
our discussion left out some solutions, as for example the one presented by Bass 
(see Sectionl^Hj, which did not fit into our line of reasoning. We will find them 
now, as we turn to the chronological order. We give a brief description of each 
solution, its properties and relevant references. The references are summarized 
in Tabled Stopping times, solutions obtained by various authors, are denoted 
by T. with subscript representing author's last name, i.e. Skorokhod's solution 
is denoted by Ts- 

3.1. Skorokhod (1961) 

In his book from 1961 Skorokhod jll4j (English translation in 1965 jll5j ) defined 
the problem under consideration and proposed the first solution. He introduced 
it to obtain some invariance principles for random walks. Loosely speaking, 
the idea behind his solution is to observe that any centered measure with two 
atoms can be embedded by the means of first exit time. One then describes 
the measure as a mixture of centered measures with at most two atoms and 
the stopping rule consists in choosing independently one of these measures, 
according to the mixture, and then using the appropriate first exit time. Actually 
Skorokhod worked with measures with density. His ideas were then extended 
to work with arbitrary centered probability measures by various authors. We 
mention Strassen |116j . Breiman ^I] (see Section and Sawyer |109| . We 
present here a rigorous solution found in Frecdman's textbook (@21j PP- 68-76) 
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and in Perkins [501 ■ Freedman used the name "Skorokhod representation" rather 
than "Skorokhod embedding problem" aUuding to the motivation of representing 
a random walk as a Brownian motion stopped with a family of stopping times. 
He used this representation to prove Donsker's theorem and Strassen's law (see 
Section [11.2(1 . He also obtained some technical lemmas, which he exploited in 
proofs of invariance principles for Markov chains (^HIj PP- 83-87). 

For Brownian motion (Bf) and a centered probability measure fi on R, define 
for any A > 



Let R be an independent random variable with the following distribution func- 
tion: 



where S stands for "Skorokhod" , satisfies Bts ~ ^-nd (BjaTs : i > 0) is a uni- 
formly integrable martingale. In his original work Skorokhod assumed moreover 
that the second moment of /i was finite but this is not necessary. 

Sawyer |109j investigated in detail the properties of constructed stopping 
times under exponential integrability conditions on the target distribution /j. In 
particular he showed that if for all x > 0, /^((— oo,x] U [x,oo)) < Cexp(— ax*^), 
for some positive C,a,e, then P(Ts > 2:) < Ci exp(— 6a;''), with b = a^~°, 
6 = €/{2 + e) and some positive constant Ci > 0. One has then E(exp(6T'|)) < 
C2 /g°° exp(a|a;|')d/i(a;), with C2 some positive constant depending only on e. 

3.2. Dubins (1968) 

Three years after the translation of Skorokhod's original work, Dubins pre- 
sented a different approach which did not require any additional random variable 
independent of the original process itself. The idea of Dubins is crucial as it will 
be met in various solutions later on. It is clearly described in Meyer |76j . Given 
a centered probability measure /i, we construct a family of probability measures 
Hn with finite supports, converging to /i. We start with /iq = 5^, the measure 
concentrated on the expectation of (i.e. 0). We look then at /i_ and ^_|_ - 
the measures resulting from restricting (and renormalizing) /i to two intervals, 
(—00,0] and (0,-1-00). We take their expectations: m_|_ and to_. Choosing the 
weights in a natural way we define a centered probability measure /^i (concen- 
trated on 2 points and m_). The three points considered so far (r7i_, 0, m^) 
cut the real line into 4 intervals and we again look at renormalized restrictions 
of jjL on these 4 intervals. We take their expectations, choose appropriate weights 
and obtain a centered probability measure /i2- We continue this construction: 
in the n*'* step we pass from a measure /i„ concentrated on 2"~^ points and a 



p{\) = inf < 2/ e M : / l{x<y}iJ{x>\}xd^i{x) < 





Then, the stopping time defined by 



Ts = mf{t : Bt i {-p{R),R)}, 
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total of 2" — 1 points cutting the real line, to the measure fin+i concentrated 
on 2" new points (and we have therefore a total of 2"+^ — 1 points cutting the 
real line). Not only does /x„ converge to /i but the construction embeds easily 
in Brownian motion using the successive hitting times (T'^)i</c<2"-i of pairs of 
new points (nearest neighbors in the support of fin+i of a point in the support 
of T is the limit of T!^" and it is straightforward to see that it solves the 
Skorokhod problem. 

Bretagnolle '12 has investigated this construction and in particular the ex- 
ponential moments of Dubins' stopping time. Dubins' solution is also reported 
in Billingsley's textbook However his description seems overly complicated^. 

3.3. Root (1968) 

The third solution to the problem came from Root |103| . We will devote Section[7| 
to discuss it. This solution has a remarkable property - it minimizes the variance 
(or equivalently the second moment) of the stopping time, as shown by Rost 
|l()7j . Unfortunately, the solution is not explicit and is actually hard to describe 
even for simple target distributions. 

3.4. Hall (1968) 

Hall described his solution to the original Skorokhod problem in a technical 
report at Stanford University [49 , which seems to have remained otherwise 
unpublished. He did however publish an article on applications of the Skorokhod 
embedding to sequential analysis |50| . where he also solved the problem for 
Brownian motion with drift. We stress this, as the problem was treated again 
over 30 years later by Falkner, Grandits, Pedersen and Peskir (cf. Section nH7|l 
and Hall's work is never cited. Hall's solution to the Skorokhod embedding for 
Brownian motion with a drift was an adaptation of his method for standard 
Brownian motion, in very much the same spirit as the just mentioned authors 
were adapting Azema and Yor's solution somewhat 30 years later. 

Hall looked at Skorokhod's and Dubins' solutions and saw that one could 
start as Dubins does but then proceed in a way similar to Skorokhod, only this 
time making the randomization internal. 

More precisely, first he presents an easy randomized embedding based on an 
independent, two-dimensional random variable. For /i, an integrable probability 
measure on R, / \x\d^{x) < 00, / xdii{x) = m, define a distribution on 
through d7?^(u,?;) = {v-u){ J^{x-m)dn{x)) ^lu<m<vd^i{u)d^.{v). Then, for 
a centered measure (m — 0), the stopping time T = mi{t > : Bt ^ {U,V)}, 
where ([/, V) ^ H independent of B, embeds /i in B, i.e. Bt ^ fJ-. This solution 
was first described in Breiman ^Jj. We remark that the idea is similar to the 
original Skorokhod's embedding and its extension by Freedman and Perkins 
(cf. Section l3.1|l . namely one represents a measure /i as a random mixture of 

^As a referee has pointed out, the justification of the convergence of the construction, on 
page 517, could be considerably simplified through an argument given in Exercise II. 7 in Neveu 
|8T). p. 37. 



J. Obioj/Skorokhod embedding problem 



334 



measure with two atoms. The difference is that in Skorokhod's construction V 
was a deterministic function of U. 

The second step consists in starting with an exit time, which will give enough 
randomness in B itself to make the above randomization internal. The crucial 
tool is the classical Levy result about the existence of a measurable trans- 
formation from a single random variable with a continuous distribution to a 
random vector with an arbitrary distribution. Define and fi^, as in Sec- 
tion 13.21 to be the renormalized restrictions of fi to M+ and ]R_ respectively, 
and denote their expectations m+ and rri- (i.e. m+ — ^(^^^q^^^ /(o oo) ^^'^(^)' 
= M~oo,o ] ) /(-oo,o] xdfj,{x)). Take Ti = Tm+Am^ to be the first exit time 
from [m_,m+]. It has a continuous distribution. There exist therefore measur- 
able transformations and /~ such that (J7%y) = /*(Ti) has distribution 
Hfi-, i E {+, — }. Working conditionally on Bti = nii, the new starting (shifted) 
Brownian motion Wg — -B^+Tu a-nd the stopping time Ti are independent. 
Therefore, the stopping time T2 — inf{s > : Ws ^ {U^, 1^*)} embeds in the 
new (shifted) Brownian motion W. Finally then, the stopping time Th = T1+T2 
embeds fi in B. 

As stated above, in jSU] this solution was adapted to serve in the case 
of B^^^ ^ Bt + 6t and Hall identified correctly the necessary and sufficient 
condition on the probability measure fi for the existence of an embedding, 
namely J e"^^^ ii{dx) < 1. His method is very similar to later works of Falkner 
and Grandits (cf. Section f3.17l and Section |5J). As a direct application, Hall 
also described the class of discrete submartingales that can be embedded in 
{B^^^ '■ t > 0) hy means of an increasing family of stopping times. 

3.5. Rost (1971) 

The first work |104| . written in German, was generalized in |105| . Rost's main 
work on the subject |l()5j . does not present a solution to the Skorokhod em- 
bedding problem in itself. It is actually best known for its potential theoretic 
results, known as the "filling scheme" in continuous time. It answers however 
a general question about the existence of a solution to the embedding problem 
for an arbitrary Markov process. The formulation in terms of a solution to the 
Skorokhod embedding problem is found in |1()6| . 

We will start with an attempt to present the general continuous-time filling 
scheme and the criterion on the laws of stopped processes it yields. We will 
then specialize to the case of transient processes. We will close this section with 
a remark on a link between Rost's work and some discrete time works (e.g. 
Pitman ^7\). We do not pretend to be exhaustive and sometimes we make only 
formal calculations, as our basic aim is just to give an intuitive understanding 
of otherwise difficult and involved results. 

Consider, as in Section l^^ a Markov process {Xt)t>a on [E, £), with poten- 
tial operator J7^. Let f, n be two positive measures on £, with finite variation. 
u is thought of as the starting measure of X, Xq ~ v, and /i is the measure for 
which we try to determine whether it can be the distribution of X stopped at 
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some stopping time"^. Rost shows that there exists a sequence of measures (vt), 
weakly right continuous in t, and a stopping time S (possible randomized) such 
that: 

• < i^t+, < i^tP^ < lyPtis^ yt,s>0; 

• vt is the law of Xtls>t- 
The family fit is defined through 

H - Ht + vt = lyPsM^ t>0. (3.1) 

We set /ioo =J, limf_>oo /^ti A = A* ^ /^oo- We say that ^ /i ("i/ is earlier than 
/x") if and only if /^oo = 0. This defines a partial order on positive measures with 
finite variation on £. 

Theorem 3.1 (Rost | 105 p. We have v < fi if and only if n = vP^ for some 
(possibly randomized) stopping time T . 

This gives a complete description of laws that can be embedded in X. How- 
ever this description, as presented here, may be hard to grasp and we specialize 
to the case of transient Markov processes to explain it. Luckily, one can pass 
from one case to another. Namely, given a Markov process X we can consider 
X", a transient Markov process, given by X killed at an independent time with 
exponential distribution (with parameter a). A result of Rost assures that if 
11%^ is the measure ^oo constructed as above but with X replaced by X", then 

Hao =i lima^oM^D- 

We suppose therefore that X is transient, and that ^C/^ and vU^ are a- 
finite. For an almost Borel set A C E the first hitting time of A, T4 inf{t > 
: Xt £ A}, is a stopping time. Note that the measure i^P-r^ is just the law of 
Xta- 

Recall, that the reduite of a measure p is the smallest excessive measure 
which majorizes p. It exists, is unique, and can be written as Pt^^ ^ certain 
A C E. We have the following characterization of the filling scheme, or balayage 
order (see Meyer |77|, Rost |TU^ '). 

The measure p, admits a decomposition p ~ JI + poo, where pooU-^ is the 
reduite of {p—v)U^ andJI is of the form vP^ , T a stopping time. Furthermore, 
there exists a finely closed set A which carries poo and for which poo = {p ^ 
v)P^^ or ijl— v)P^^ — 0. In the special case pU^ < vU^ we have p^o = and 
p = vP^ for some T . 

We can therefore reformulate Theorem lH.ll above: for some initial distribution 
Xq ~ V, there exists a stopping time T such that Xt ~ /it if and only if 

pU^ < vU^. (3.2) 

•^Note that in our notation the roles of /i and v are switched, compared to the articles of 

Rost irnsiirnTi . 
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We now comment on the above: it can be rephrased as "/i happens after i/" . It 
says that 



for any positive, Borel function f : E R+, where vJJ-^ {f) = 

and denotes the expectation under the initial measure Xq 
can be written as: 



- V. The RHS 



i.f/^(/) =E, 



f{Xt)dt 



f{Xt)dt 



f{Xt)dt +iJiU''{f). 



One sees that in order to be able to define T, condition H3.2|l is needed. On 

the other hand if it is satisfied, one can also hope that Ej^ f{Xt)dt = 

vU-^[f) — ^jLU^{f), defines a stopping time. Of course this is just an intuitive 
justification of a profound result and the proof makes a considerable use of 
potential theory. 

Rost developed the subject further and formulated it in the language of 
the Skorokhod embedding problem in 106,. We note that the stopping rules 
obtained in this way may be randomized as in the original work of Skorokhod. 

An important continuation of Host's work is found in two papers of Fitzsim- 
mons. In I3H1 (with some minor correction recorded in |39|) an existence theorem 
for embedding in a general Markov process is obtained as a by-product of inves- 
tigation of a natural extention of Hunt's balayage Lbiti (see j38) for appropriate 
definitions). The second paper is devoted to the Skorokhod problem for 
general right Markov processes. Suppose that Xq ~ ly and the potentials of ly 
and are cr- finite and iiU-^ < vU-^ . It is shown then that there exists then a 
monotone family of sets {C(r);0 < r < 1} such that if T is the first entrance 
time of C(i?), where R is independent of X and uniformly distributed over [0, 1], 
then Xt ^ /i. Moreover, in the case where both potentials are dominated by 
some multiple of an excessive measure m, then the sets C(r) are given explicitly 

in terms of the Radon-Nikodym derivatives ''^^^^ and '^'•^^ ^ . 

To close this section, we make a link between the discrete filling scheme 
and some other works for discrete Markov chains. Namely, we consider the 
occupation formula for Markov chains developed by Pitman ([HHIj EH)- Let 
{Xn : n > 0) be a homogenous Markov chain on some countable state space 
J, with transition probabilities given by P = P{i, Let T be a finite 

stopping time for X . Define the pre-T occupation measure jt through 7t(^) = 
^{^n e A,T > n) , A C J . Let X he the initial distribution of the process, 
Xq ^ A, and At the distribution of Xt- Then Pitman (97} . as an application of 
the occupation measure formula, obtains: 



(3.3) 



Given a distribution fi on J this can serve to determine whether there exists a 
stopping time T such that Xt — fJ.. 
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We point out that the above can be obtained via Rost's discrete filling 
scheme. Rost ( \W5\ p. 3) obtains 

/i - A + M = MP, where M = ^ A„ . (3.4) 

n 

Applying this to the process — (X„at : n > 0), we have M = jt- Rost's 
condition in Theorem 13. II namely jl — ^, is then equivalent to (|3.3|l . 

The occupation measure formula admits a continuous-time version (see Fitzsim- 
mons and Pitman 41 ), but we will not go into details nor into its relations with 
the continuous-time filling scheme of Rost. 

3.6. Monroe (1972) 

We examine this work only very briefly. Monroe |79j doesn't have any explicit 
solutions but again what is important is the novelty of his methodology. Let 
{L^)t>o be the local time at level a of Brownian motion. Then the support of 
dL^ is contained in the set {Bt = a} almost surely. We can use this fact and 
try to define the stopping time T through inequalities involving local times at 
levels a ranging through the support of the desired terminal distribution /z. The 
distribution of Bt will then have the same support as fi. Monroe UH] shows 
that we can carry out this idea and define T in such a way that Bt ~ A* ^^'^ 
{Btf\T)t>Q is a uniformly integrable martingale. We will see an explicit and more 
general construction using this approach by Bertoin and Le Jan in Section f3.15l 
To a lesser extent also works of Jculin and Yor [^01 and Vallois |118| can be seen 
as continuations of Monroe's ideas. 

3.7. Heath (1974) 

Heath jJS] considers the Skorokhod embedding problem for the n-dimensional 
Brownian motion. More precisely, he considers Brownian motion killed at the 
first exit time from the unit ball. The construction is an adaptation of a potential 
theoretic construction of Mokobodzki, and relies on results of Rost (see Section 
I3.5|l . The results generalize to processes satisfying the "right hypotheses". We 
note that the stopping times are randomized. 

3.8. Chacon and Walsh (1974-1976) 

Making use of potential theory on the real line, Chacon and Walsh J5] gave an 
elegant and simple description of a solution which happens to be quite general. 
Their work was based on an earlier paper of Chacon and Baxter , who worked 
with a more general setup and obtained results, for example, for n-dimensional 
Brownian motion. This approach proves very fruitful in one dimension and we 
will try to make the Chacon and Walsh solution a reference point throughout 
this survey. 

Recall from Definition 12 . 21 that . for a centered probability measure n on R, 
its potential is defined via Ufi{x) = — J^\x — y\dfj.{y). We will now use the 
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properties of this functional, given in Proposition 12.31 to describe the solution 
proposed by Chacon and Walsh. 

Write fiQ = Sq. Choose a point between the graphs of UfiQ and Ufi, and 
draw a line h through this point which stays above U /i (actually in the original 
construction tangent lines are considered, which is natural but not necessary). 
This line cuts the potential USo in two points ai < < 61. We consider the new 
potential U /zi given by U ^0 on (—00, ai] U [61, 00) and linear on [ai, 61]. We iter- 
ate the procedure. The choice of lines which we use to produce potentials 
is not important. It suffices to see that we can indeed choose lines in such a way 
that U jjL (and therefore ^ fJ-)- This is true, as Ufi is a concave func- 

tion and it can be represented as the infimum of a countable number of affine 
functions. The stopping time is obtained therefore through a limit procedure. 
If we write Ta^b for the exit time of [a, b] and for the standard shift operator, 
then Ti = Ta^M^ T2 = Ti + Ta^^b2 ° ^'ti, = T^-i + Ta^,b„ o 9t„_i and 

T = limT„. It is fairly easy to show that the limit is almost surely finite and for 
a measure with finite second moment, ET = J x^d^{x) (via (v) in Proposition 
12.311 . The solution is easily explained with a series of drawings: 
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1. The initial situation 2. The first step 
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3. After the first step 4. After the second step 



What is really being done on the level of Brownian motion? The procedure 
was summarized by Meilijson Given a centered measure [i express it as a 
limit of [In - measures with finite supports. Do it in such a way that there exists a 
martingale (X„)„>i, Xn ^ fJ-m which has almost surely dichotomous transitions 
(i.e. conditionally on the past with X„ = a, there are only two possible values 
for Xn+i). Then embed the martingale X„ in Brownian motion by successive 
first hitting times of one of the two points. 

Dubins' solution (see Section is a special case of this procedure. What 
Dubins proposes is actually a simple method of choosing the tangents. To obtain 
the potential Ufii draw tangent at (0, Ufj.{0)), which will cut the potential U60 
in two points a < < 5. Then draw tangents in (a, U ^{a)) and (&, U ^{b)). The 
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lines will cut the potential Ufii in four points yielding the potential C//Z2. Draw 
the tangents in those four points obtaining 8 intersections with U ^2 which give 
new coordinates for drawing tangents. Then, iterate. 

Chacon and Walsh solution remains true in a more general setup. It is very 
important to note that the only property of Brownian motion we have used 
here is that for any a < < 6, F(i?T„ t, = 1) = 532^- This is the property which 
makes the potentials of fin piece- wise linear, or more generally which makes 
the assertion (vii) of Proposition 12.31 true. However, this is true not only for 
Brownian motion but for any continuous martingale (Mt) with {M,M) 00 — 00 
a.s. The solution presented here is therefore valid in this more general situation. 

The methodology presented above, allows us to recover intuitively other 
solutions and we will make of it a reference point for us. In the case of a measure 
/i with finite support, this solution contains as a special case not only the solution 
of Dubins but also the solution of Azema and Yor (see Section 0J). The general 
Azema-Yor solution can also be explained via this methodology (cf. Section [5?T)l 
as well as the Vallois construction (cf. Section r3.12|l as proven by Cox |22| . 

3.9. Azema-Yor (1979) 

The solution developed by Azema and Yor has received a lot of attention in 
the literature and its properties were investigated in detail. In consequence we 
might not be able to present all of the relative material. We will first discuss 
the solution for measures with finite support in Sectional and then the general 
case in Section |S1 The general stopping time is given in H5.3|l and the Hardy- 
Littlewood function, on which it relies, is displayed in (|5.2|l . 

3.10. Falkner (1980) 

Falkner 36 discusses embeddings of measures into the n-dimensional Brownian 
motion. This was then continued for right processes by Falkner and Fitzsimmons 
P7] . Even though we do not intend to discuss this topic here we point out some 
difficulties which arise in the multi-dimensional case. They are mainly due to 
the fact that the Brownian motion does not visit points any more. If we consider 
measures concentrated on the unit circle it is quite easy to see that only one 
of them, namely the uniform distribution, can be embedded by means of an 
integrable stopping time. Distributions with atoms cannot be embedded at all. 

3.11. Bass (1983) 

The solution proposed by Bass is basically an application of Ito's formula and the 
Dambis-Dubins-Schwarz theorem. Bass uses also some older results of Yershov 
[nS|. This solution is also reported in Stroock's book f fTTT] . p. 213-217). Let F 
be the distribution function of a centered measure fi, $ the distribution function 
of 7V(0, 1) and pt{x) the density of 7V(0, t). Define function ^(a:;) = F~^($(a;)). 
Take a Brownian motion (i?t), then g{Bi) ^ fj,. Using Ito's formula (or Clark's 
formula) we can write 
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g{Bi) ^ J a{s,Bs)dBs, where a{s,y) ^ - J 



— g{z + y)dz. 



We put a{s,y) = 1 for s > 1 and define a local martingale M through Mt = 
a{s, Bs)dBs. Its quadratic variation is given by R{t) = J^a^{s,Bs)ds. De- 
note by S the inverse of R. Then the process Nu = -M5(„), u > 0, is a Brownian 
motion. Furthermore A^fl;(i) = Mi ^ ^. Bass j3] then uses a differential equa- 
tion argument (following works by Yershov {123| ) to show that actually -R(l) 
is a stopping time in the natural filtration of N. The argument allows to con- 
struct, for an arbitrary Brownian motion (/3t), a stopping time T such that 
(A^^(i), i?(l)) ~ {Pt,T), which is the desired solution to the embedding prob- 
lem. 

In general the martingale M is hard to describe. However in simple cases it 
is straightforward. We give two examples. First consider fi = ■^{5-i + Si). Then 

Mt = sgn{Bt)(l-2'i>(-^)), t<l. 



Write gi for "the last before time 1," that is: gi — sup{u < 1 : i?„ = 0}. This 
is not a stopping time. The associated Azema supermartingale (cf. Yor I124j p. 
41-43) is denoted Z^^ = F{gi > t\!Ft), where (jFt) is the natural filtration of 
(Bt). We can rewrite M in the following way 

Mt = sgn{Bt)il - 2Zf ) = E [sgn{Bt)il - 21g,yt)\J't 

Consider now fi ~ jSq + |(^-2 + S2) and write ^ — — Then 

Mt = sgniBt)2{^{::i±B^) - $(^^)) = ^ [l.,>e - 1b.<-,\^. 

What really happens in the construction of the martingale M should now be 
clearer: we transform the standard normal distribution of Bi into /Lt by a map 
R ^ M and hit this distribution by readjusting Brownian paths. 



3.12. Vallois (1983) 

In their work, Jeulin and Yor 60 studied stopping times T^^^ — mi{t > : 
B^h{Lt) + B^k{Lt) = 1}, where B and L are respectively Brownian motion 
and its local time at 0, and fc, h are two positive Borel functions which satisfy 
some additional conditions. They described the distributions of Xt and {Xt, T). 
This, using Levy's equivalence theorem which asserts that the two-dimensional 
processes {St, St — Bt) and (Lt, \Bt\) have the same law, can be seen as a gener- 
alization of results of Azema and Yor Vallois 118 follows the framework of 
Jeulin and Yor [HOI and develops a complete solution to the embedding problem 
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for Brownian motion. For any probability measure on R he shows there exist 
two functions and h~ and ^ > such that 



embeds /i in B, Bt^ ^ fi, where Tg" = mf{t : Lt = S}. His method allows us to 
calculate the functions h'^ and h~ as is shown in several examples. Furthermore, 
Vallois proves that {Bt^Tv : ^ > 0) is a uniformly integrable martingale if and 
only if /i has a first moment and is centered. 

The formulae obtained by Vallois are quite complicated. In the case of sym- 
metric measures they can simplified considerably as noted in our paper |84|. 

3.13. Perkins (1985) 

Perkins [201 investigated the problem of iJ^-embcdding and developed a solution 
to the Skorokhod problem that minimizes stochastically the distribution of the 
maximum of the process (up to the stopping time) and maximizes stochastically 
the distribution of the minimum. We will present his work in Section [6. II 

3.14. Jacka (1988) 

Jacka was interested in what can be seen as a converse to Perkins' goal. He 
looked for a maximal i?^-embedding, that is an embedding which maximizes 
stochastically the distribution of the maximum of the absolute value of the 
process. We describe his solution in Section 16.21 It seems that Jacka's work 
was mainly motivated by applications in the optimal stopping theory. In his 
subsequent paper, Jacka |59| used his embedding to recover the best constants 
Cp appearing in 



The interplay between Skorokhod embeddings and optimal stopping theory, of 
which we have an example above, is quite rich and diverse. We will come back 
to this matter in Section nTH 

3.15. Bertoin and Le Jan (1993) 

Tools used by Bertoin and Le Jan in "B* to construct their solution to the Sko- 
rokhod problem arc similar to those employed by Monroe The authors 
however do not rely on Monroe's work. They consider a much more general 
setting and obtain explicit formulae. 

Let E he a, locally compact space with a countable basis, and X a Hunt 
process on E, with a regular recurrent starting point G -E. The local time 
of X at is denoted by L. Authors use the correspondence between positive, 
continuous, additive functionals and their Revuz measures (see Revuz |98jV The 
invariant measure A which is used to define the correspondence is given by 



Tv = Tt A inf{t : B+ = h+{Lt) or h^{Lt)} 




(3.5) 



fdX 
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where rj is the characteristic measure of the excursion process of X, c is the 
delay coefficient* and C is the first hitting time of 0. 

Take a Revuz measure /x with /^({O}) = 0, < 1 and the positive 

continuous additive functional associated with it. Then T = inf{i : > Lt} 
embeds fi in X. However this solution is not a very useful one as we have 
KLt — oo. 

Consider an additional hypothesis: 

There exists a bounded Borel function , such that for every Revuz 
measure v: J V^diy — J ¥.x{A'^)diJ.{x). (3.6) 

Then, if /j, is a probability measure and 70 = ||Vu||cx), for any 7 > 70, the 
following stopping time embeds fi in X: 

r^LJ = inf > : 7^*(7 - MX.)r'dA': > Lt}. (3.7) 

Furthermore this solution is optimal in the following sense. Consider any Revuz 
measure v with A"^ not identically 0. Then Eq AJi-, = f (7 — Vn,)di' and for any 

solution S of the Skorokhod problem for /i, Eq Ag > J (70 — Vfj)dv. 

Authors then specify to the case of symmetric Levy processes. Let us inves- 
tigate this solution in the case of X a real Brownian motion. The function 
exists and its supremum is given by 70 = ||V^;^||oo — 2max{J x~^dfi, J dfi}. 
Hypothesis 13.61 is equivalent to fi having a finite first moment. In this case in- 
troduce 

^(^) _ / 2/„>,(a - x)d^iia) , for x > 
I 2/„<,(a;-a)dM(a) , for x < 0. 

Then 

TSIj = inf {i : 70 1 -^/f^i^) > ^?}- 

Note that if supp{fj,) C [a, 6] then p{a) — p{b) — 0, so in particular Tblj < Ta.b, 
the first exit time from [a, b]. 

3.16. Vallois (1994) 

In Section 13.121 above we described a solution to the Skorokhod embedding 
problem, based on Brownian local time, developed by Vallois in |118| . Here we 
present a different solution which Vallois described in |12()j . Actually, Vallois 
considered a generalization of the Skorokhod embedding problem. 

Let (Aft : t > 0) be a continuous local martingale with Moo ~ M- Vallois 
gave a complete characterization of the class of possible laws of Soo — supog 
(see also Section [lU.3|l . To prove his characterization he had to solve the following 
embedding problem: let /x G A^J and ni be a sub-probability measure on M+ 

^That is; the holding time at has an exponential distribution with parameter c. 
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such that jJL — is a positive measure on M_|_. For [Bt : < > 0) a Brownian 
motion, and St — sup„<ji3„, construct a stopping time T such that Bt ~ /i, 
Bt\{Bt=St} ~ /^i ^^'^ {BtAT : i > 0) is a uniformly integrable martingale. 
Vallois |12()| . showed that for each such couple there exists an increasing 

function a such that the stopping time 

T,,,^, = inf{i > : (Bt - a{St), St i T) or Bt - A}, (3.8) 

where T — {x ^ supp{pL) : a{x) — a;} and A is an independent random variable 
whose distribution is a specified function of and /ii, solves the embedding 
problem under consideration. 

Vallois' construction provides a link between two extreme cases: ^xi —0 and 
Ml = ^A[o,ao)■ In the first case, = 0, we just have T^^q = Tay, where Tay 
is the Azema-Yor stopping time defined via H5.3() . This case yields the maximal 
possible distribution of St (see Section The second case, = mI[o,oo): 

yields the minimal possible distribution of St, which coincides with S'tp, where 
Perkins' solution Tp is given by 

We believe that one could see Vallois' reasoning via one-dimensional po- 
tential theory but we were not able to do so. This is partially due to the fact 
that the stopping rule obtained is randomized. Vallois |120| in his proof used 
martingale theory, in particular he relied on the martingales displayed in (|5.11() . 

3.17. Falkner, Grandits, Pedersen and Peskir (2000 - 2002) 

We mention, exceptionally, several solutions in this one subsection, the reason 
being that they are all closely connected and subsequent ones were thought of 
as generalizations of the preceding. The solution developed by Azema and Yor 
PI works for recurrent real-valued diffusions. It is natural to ask whether this 
solution could be generalized for transient diffusions. 

Brownian motion with drift was the first transient diffusion considered. It 
was done by Hall back in 1969 as we stressed in Section however his work 
doesn't seem to be well known. Consequently, the subject was treated again in 
a similar manner by Grandits and Falkner and Peskir [^21 ■ They show that 
for a probability measure there exists a stopping time T such that B^p = 
Bt + &T ^ PL {5 > Q) ii and only ff / e-^'^'' dfi{x) < 1. The authors work with 

the martingale Yt = ex.p{—25Bl^'') — 1 rather than with the process B^^^ itself 
and they adapt the arguments used by Azema and Yor. Grandits and Falkner 
gave an explicit formula for T when J e~-^^^ dp{x) — 1, while Peskir obtained 
an explicit formula valid in all cases and showed that the solution maximizes 
stochastically the suprcmum of the process up to the stopping time. He also 
showed that letting 5 Q one recovers the standard Azema-Yor solution. The 
arguments used were extended by Pedersen and Peskir j89| . to treat any non- 
singular diffusion on R starting at 0. 

The methodology of these embeddings can be summarized as follows. Given 
a diffusion, compose it with its scale function. The result is a continuous local 
martingale. Use the Dambis-Dubins-Schwarz theorem to work with a Brownian 
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motion. Then use the Azema-Yor embedding and follow the above steps in a 
reversed direction. Some care is just needed as the local martingale obtained 
from the diffusion might converge. We develop this subject in Section |51 

3.18. Roynette, Vallois and Yor (2002) 

These authors |108j consider Brownian motion together with its local time at 
as a Markov process X, and apply Rost's results (see Section to derive 
conditions on /i - a probability measure on R x K_|_ - under which there exists 
an embedding of /x in X. They give an explicit form of the condition and of the 
solution whenever there exists one. 

Consider ^{dx, dl) = v{dl)M{l^ dx) a probability measure on R x R_|_, where 
V is a. probability measure on R+ and M is a Markov kernel. Then there exists 
a stopping time Trvy, such that (Btrvyj -^Trvy) ~ s-^d (-BtATavY ^ ^ > 0) is 
a uniformly integrable martingale if and only if 

1. /uxR+ \x\n{dx,dl) < oo, 

2. m{l) := J^x+M{l,dx) = J^x~M{l,dx) < oo, 

3. m{l)v{dl) ~ oo)(i/. 

Moreover, if v is absolutely continuous with respect to Lebesgue measure, then 
T is a stopping time with respect to Tt — u{Bs '■ s < t). 

An explicit construction of Trvy is given. It is carried out in two steps. 
First, a stopping time Ti is constructed which embeds u in L - it follows argu- 
ments of Jeulin and Yor |Snj. Then, using the Azema-Yor embedding, T > Ti is 
constructed such that conditionally on Lt^ ~ I, Bt ^ M{l,dx) and Bt has no 
zeros between Ti and T, so that Lt^ — Lt- 

This construction yields two interesting studies. It enables the authors in 
jl()8j to investigate the possible laws of Lt, where T is a stopping time such 
that {BtAT : < > 0) is a uniformly integrable martingale, thus generalizing the 
work of Vallois |12(J) . It also gives the possibility to look at the question of 
independence of T and Bt (primarily undertaken in |28| . |29p. 

3.19. Hambly, Kersting and Kyprianou (2002) 

These authors |51) develop a Law of Iterated Logarithm for random walks condi- 
tioned to stay nonnegative. One of the ingredients of their proof is an embedding 
of a given distribution into a Bessel(3) process. This distribution is actually the 
distribution of a renewal function of the initial distribution under the law of 
random walk conditioned to stay nonnegative. Authors use properties of the 
latter to realize their embedding. It is done in two steps, as they first stop the 
process in an intermediate distribution. Both parts of the construction require 
external randomization, independent of the process. 
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3.20. Cox and Hobson (2004) 

The work of Cox and Hobson can be seen as a generalization of Perkins' solu- 
tion (see Section r6.1(l to the case of regular diffusions and measures which are not 
necessarily centered. Indeed, consider any probability measure fi on M. The au- 
thors define two functions 7_ and 7-|_, which for a centered probability measure 
H coincide with the ones defined by Perkins. For a continuous local martingale 
(M() define its maximum and minimum processes through 5"*^ — sup^<j Mg 
and sf^ = — inisKt Ms. Then, given that {M,M)oo = oo, the stopping time 
Ten = inf{t > : Mt ^ (-7. (5*0, 7+(sf ))} embeds fi in M. Furthermore, it 
minimizes stochastically the supremum and maximizes stochastically the min- 
imum (cf. (|6.4() . H6.5I) '). The setup is then extended to regular diffusions using 
scale functions. For regular diffusions, scale functions exist and are continuous, 
strictly increasing. This implies that the maximum and minimum are preserved. 
Conditions on the diffusion (through its scale function) and on /i, for the exis- 
tence of an embedding, are given. 

Further, Cox and Hobson investigate the existence of iJ^-embeddings (see 
Section . Conditions are given which in important cases are necessary and 
sufficient. 

3.21. Obldj and Yor (2004) 

Almost all explicit solutions discussed so far dealt with continuous processes. 
The solution proposed by Bertoin and Le Jan (cf. Section r^. 15(1 is the only ex- 
ception, and even this solution is explicit only if we have easy access to additive 
functionals and can identify the function given by 1(3.6(1 . Stopping discontin- 
uous processes is in general harder, as we have less control over the value at the 
stopping time. Yet, in some cases, we can develop tools in discontinuous setups 
inspired by the classical framework. This is the case of an embedding for "nice" 
functionals of Brownian excursions, described in our joint work with Yor (84( . 
This solution was inspired by the Azema-Yor solution (see Section EJ, as we ex- 
plain below. For the sake of simplicity we will not present the reasoning found in 
Obloj and Yor [SJ in all generality but we will rather restrain ourselves to the 
case of the age process. We are interested therefore in developing an embedding 
for the following process: At = t — gt, where gt = sup{s < t : Bg — 0} is the 
last zero before time t. In other words. At is the age, at time t, of the Brow- 
nian excursion straddling time t. This is a process with values in M+, which is 
discontinuous and jumps down to at points of discontinuity. More precisely, 
we will focus on an equivalent process given by At = \J\{t — gt)- This process 
is, in some sense, more natural to consider, as it is just the projection of the 
absolute value of Brownian motion on the filtration generated by the signs. 

Starting with the standard Azema-Yor embedding (see Section [^?T|l . let us 
rewrite their stopping time using Levy's equivalence theorem. 

Tay i^^{t>Q:St<-^t,{Bt)} = ini{t>Q:St--^-\St)>St~Bt} 

inf{i > : *~^(Lt) > \Bt\}, where = Id- (3.9) 
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The idea of replacing the supremum process by the local time process comes 
from the fact that the latter, unlike the former, is measurable with respect to 
the filtration generated by the signs (or by the zeros) of Brownian motion. Even 
though the function VP^ is in general hard to describe, the above formula suggests 
that it might be possible to develop an embedding for the age process of the 
following form: T = mi{t > At > (p{Lt)}, where the function (p would be a 
function of the measure n we want to embed. This proves to be true. For any 
measure /z on R+ with ^({0}) = 0, let [a, b] be its support, Jlix) = fi{[x, oo)) its 
tail, and define the dual Hardy-Littlewood function^ 

i'tj.i^) = / =rTC^M(y), for a < X < 6, (3.10) 
J[o,x] IJ'[y) 

— for < X < a and tp^j,{x) = oo for x > b. The function tp^ is right- 
continuous, non-decreasing and < for all x < b. We can then define its 
right inverse ip^ — Then the stopping time 

Toy =inf{t > : > ^^{Lt)} 

embeds /i in A, Atq^ ~ 

The proof is carried out using excursions theory (even though a different 
one, relying on martingale arguments, is possible too). The crucial step is the 
calculation of the law of Ltq-y-j which we give here. Note (t;) the inverse of the 
local time at 0, (e;) the excursion process and V{ei) — Ar,- the rescaled lifetime 
of the excursion. 

F(iToY > = HTo^ > n) 

= P ^on the time interval [0, ti] for every excursion Cs, s < Z, 
its (rescaled) lifetime did not exceed 1^9^(5)^ 

= Kg - 0) - -P ( - / ^) ' (3-11) 

where the last equality follows from the fact that the process 
Ni = X]s<i 1-F(es)>v(s)' ^ > 0, is an inhomogeneous Poisson process with ap- 
propriate parameter. The core of the above argument remains valid for much 
more general functionals of Brownian excursions, thus establishing a method- 
ology of solving the Skorokhod embedding in a number of important setups. 
In particular, it yields an embedding for the absolute value {\Bt\ : t > 0), giv- 
ing the explicit formula in the Vallois solution (see Section r3.12fl for symmetric 
measures. 



''Compare with the Hardy-Littlewood function used by Azema and Yor given in 
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Table 1: Genealogy of solutions to the Skorokhod embedding problem 
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4. Embedding measures with finite support 

We continue here the discussion of Chacon and Walsh's solution from Section 
13.81 The target measure fi, which is a centered probability measure on R, is 
supposed to have a finite support, i.e.: ^ = X]r=i '^i^xi^ where > 0, J2i=i '^i — 
1 and X]r=i Q^i^i — 0, Xi ^ Xj for I < i =/= j < n. Recall the potential U ^ given in 
Definition l2.2l In the present situation, it is a piece-wise linear function breaking 
at the atoms {xi, . . . ,Xn}- It is interesting that in this simple setting several 
solutions are easily seen to be special cases of the Chacon- Walsh solution. 

4.1- Continuous local martingale setup 

Let (Mt) be a continuous local martingale, Mq — 0, with (Af, M)oo = oo a.s. 
and consider the embedding problem of /i in M. First we can ask: how many 
solutions to the embedding problem exist? The answer is simple: except for the 
trivial case, an infinite number. Recall that n is the number of points in the 
support of fi. We say that two solutions T and S that embed fi are different if 
P(T = S*) < 1. 

Proposition 4.1. Ifn>3, then there exists uncountably many stopping times 
T which embed jj,: Mt ~ A*, and such that (Mt/^r) is a uniformly integrable 
martingale. 

Proof. This is an easy consequence of the Chacon and Walsh embedding de- 
scribed in Section 13.81 We describe only how to obtain an infinity of solutions 
for n = 3, the ideas for higher values of n being the same. Our reasoning is 
easiest explained with a series of drawings: 



fli bi 




Note that xi = 02, X2 = 03 and x^ — 63. Consider the set of paths to which 
are stopped in 02 — xi: At — {uj : M{ijj)t{u)) — It can be described as 
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the set of paths that hit ai before bi and then 02 before &2 ■ It is clear from our 
construction that we can choose these points in an infinite number of ways and 
each of them yields a different embedding (paths stopped at Xi differ). Indeed 
as for any oi < oi < < 62 < 62 < P(Tki < < < T,, J > 0, the result 
follows from the fact that if F(Bt 7^ Bs) > then P(5 = T) < 1. □ 

However in their original work Chacon and Walsh allowed only to take lines 
tangent to the graph of U fi. In this case we have to choose the order of drawing 
n — I tangent lines and this can be done in (n — 1)! ways. Several of them are 
known as particular solutions. Note that we could just as well take a measure 
with a countable support and the constructions described below remain true. 

• The Azema-Yor solution consists in drawing the lines from left to right. 
This was first noticed, for measures with finite support, by Meilijson |73| . 
The Azema-Yor solution can be summarized in the following way. Let X ~ 
/i. Consider a martingale Yi = E[A|lx=2;i, ■ • ■ , 1-x=xi], Yq — 0. Note that 
Yi has dichotomous transitions, it increases and then becomes constant 
and equal to X as soon as it decreases, Y„_i = X. Embed this martingale 
into {Mt) by consecutive first hitting times. It suffices to remark now 
that the hitting times correspond to ones obtained by taking tangent lines 
in Chacon and Walsh construction, from left to right. Indeed, the lower 
bounds for the consecutive hitting times are just atoms of /i in increasing 
order from xi to Xn-i- 

Another way to see all this is to use the property that the Azema-Yor 
solution maximizes stochastically the maximum of the process. If we want 
the supremum of the process to be maximal we have to take care not 
to "penalize big values" or in other words "not to stop too early in a 
small atom" - this is just to say that we want to take the bounds of our 
consecutive stopping times as small as possible, which is just to say we 
take the tangent lines from left to right. 

Section [S] is devoted to the study of this particular solution in a general 
setting. 

• The Reversed Azema-Yor solution consists in taking tangents from 
right to left. It minimizes the infimum of the stopped process. We discuss 
it in Section Is. 31 

• The Dubins solution is always a particular case of the Chacon and Walsh 
solution. In a sense it gives a canonical method of choosing the tangents. 
We described it in Section ITSl 

4-2. Random walk setup 

When presenting above, in Section 13.81 the Chacon and Walsh solution, we 
pointed out that it only takes advantage of the martingale property of Brown- 
ian motion. We could ask therefore if it can also be used to construct embeddings 
for processes with countable state space. And it is very easy to see the answer 
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is affirmative, under some hypothesis which guarantees that no over-shoot phe- 
nomenon can take place. 

Let (M„)„>o be a discrete martingale taking values in Z with Mq ~ a.s. 
Naturally, a symmetric random walk provides us with a canonical example and 
actually this section was motivated by a question raised by Fujita j44| concerning 
precisely the extension of the Azema-Yor embedding to random walks. 

Denote the range of M by i? = {z € Z : inf{n > : Af„ = z} < oo a.s.} and 
let fihe a centered probability measure on R. It is then an atomic measure^. We 
can still use the Chacon and Walsh methodology under one condition. Namely, 
we can only use such exit times Ta,b that both a, 6 G R. Otherwise we would 
face an over-shoot phenomena and the method would not work. 

Naturally in the Azema-Yor method, the left end points, (ofe), are just the 
consecutive atoms of /z, which by definition are in R. The right-end points are 
determined by tangents to Uji as is described in detail in Section 15.11 below. 
They are given by 4'^(ofe), where is displayed in H5.2() . and they might not 
be in R. 

Proposition 4.2. In the above setting suppose that the measure fi is such that 
\E'p(]R) C R, where is given by 1^5. Then the Azema-Yor stopping time, 
T = inf{n > : S'„ > ^p(Af„)}, where 5„ — niaxfc<„ M^, embeds the measure 
fi in M, Mt ^ 

The same Proposition, for a random walk, was also obtained by Fujita j44| . 
He used discrete martingales, which are closely related to the martingales used 
by Azema and Yor ,3 in their original proof, and which we discuss in Section 

We close this section with two observations. First, notice that the condition 
^'^(M) C i? is not necessary for the existence of a solution to the Skorokhod 
embedding problem. We give a simple example. Consider a standard, symmetric 
random walk, (X„ ; n = 0,1,...), and take = |(5-3 -I- + ^^2- Then 
\E'p(0) = I ^ Z, but one can easily point out a good embedding. It suffices to 
take tangents to Uii from right to left, that is to consider the reversed Azema- 
Yor solution. Then the stopping time T = inf{n > T_i^2 : Mn ^ [—3, 0]} embeds 
/i in M . 

Finally, it is not true that for any centered discrete probability measure //, 
there exists a stopping time T, in the natural filtration of X, such that Xt ~ A* 
and {XnAT : n > 0) is a uniformly integrable martingale. This was observed by 
Cox Plj. It suffices to take = ^{S-i+Sq + Si) (one can also impose A*({0}) = 
and still find easy examples). Therefore, for some measures, a solution to the 
Skorokhod embedding problem for random walk has to rely on an external 
randomization. This subject requires further investigation. 

5. Around the Azema-Yor solution 

In this section we present various results concerning the solution proposed by 
Azema and Yor j3|. We first provide a very simple proof of the solution in 

^The support is not necessary finite but this is not crucial for the method, so we decided 
to place this discussion in this section. 
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Section [Ql The proof is based on the methodology of Chacon and Walsh (see 
Section l3.8() and allows us very easily to generalize the Azema-Yor embedding 
for the case of nontrivial initial laws. This is done in Section 15.21 The proof 
encouraged us too to give, in Section|^21 another, reversed, form of the solution. 
In subsequent Section [5.41 we explain the original ideas, based on martingale 
theory, which stood behind the work of Azema and Yor. This allows us, in 
Section to prove easily certain extremal properties of this solution. 

We give first a brief history of this solution. It was presented in the 13*^ 
"Seminaire de Probabilites" in |2| together with a follow up [2] , which examined 
some maximizing property (see Section 15.51 below 1 of the solution. The original 
arguments are also found in Revuz and Yor (|5HI pp. 269-276). The construction 
in [3j was carried out for a continuous local martingale and extended easily to 
recurrent, real- valued diffusions. The original proof was then simplified by Pierre 
inSj and Vallois |118j . A proof using excursion theory was given by Rogers 
An elementary proof was given by Zaremba |126| . This work jl26| not only gives 
a "bare-handed" proof, but doing so it actually provides a convenient limiting 
procedure that allows us to see that Azema and Yor's solution is, in a sense, a 
special case of Chacon and Walsh's solution (see also Section 0J. The Azema- 
Yor solution has been generalized by several authors - we described their papers 
together in Section !^. 171 Finally, using Levy's equivalence {S — B, S) ~ {\B\,L), 
works using the local time at to develop a solution to the embedding problem 
can be seen as generalizing Azema and Yor's construction, and we mention 
therefore articles by Jeulin and Yor 'FO' and by Vallois |118| (see Section [3. 12|) . 

5.1. An even simpler proof of the Azema-Yor formula 

In this section, for simplicity of notation, we write "AY solution" for Azema 
and Yor's solution. 

We saw in Section 0] how the AY construction for a measure with finite 
support is just a special case of the method described by Chacon and Walsh, 
and consists in taking as tangent lines the extensions of the linear pieces of the 
potential from left to right. In this section we will see how to generalize this 
idea. 

Let fj, he a centered probability measure on R and x a point in the interior 
of its support, such that fi has no atom at x: fj,{{x}) — 0. Recall the one- 
dimensional potential given in Definition 12.21 We are interested in the tangent 
line Ix to U ^ at {x, Ufi{x)) and its intersection with the line {{t, —t) : t G M}. 
Denote this intersection by (^'^(a;), — ^^(x)). The following drawing summa- 
rizes these quantities: 
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It is straightforward to find the equation satisfied by Ix- Its slope is given by 
\ t=x and so we can write = {{x,y) : y = "^^^^^^ \ t=x ■ x + P}. Differenti- 
ating H2.3|l we obtain 



dUn{t) 
Jt 



^ 2/i([i, oo)) - 1 - 2tijL{dt) + 2tfj.{dt) 

- 2^([i,oo))-l. (5.1) 



We go on to calculate /3. Using the fact that (a:, Ufi{x)) € Ix together with (|2.3|l 
and H5.1|l . we obtain /3 — —2 J^^ yd^{y). We are ready to determine the value 
of ^{x) writing the equation of the intersection of Ix with {(i, —t) : < G M}: 



-\i> 



{x) = (2ijl{[x, oo)) - 1)) ^^,{x) -2 ydfiiy), hence 

^ ^ Jlx.,oo) 



1 



fj,{[x,oo)) J,^ 



^^{x) = ^ / ydfi{y), (5.2) 



which is just the barycentre function defined by Azema and Yor, which is also 
called the Hardy-Littlewood (maximal) function. We still have to deal however 
with atoms of fi. Clearly in atoms the potential is not differentiable and the 
tangent is not unique. It is easy to see from the construction of 'i'^{x) that it 
is a non-decreasing, continuous function (for x with iJ,{{x}) = 0). Fix an x such 
that /x({x}) > 0. To assign value to ^'^(a;) we follow the case of measures with 
finite support described in Section^ We see that, as a consequence of taking 
the tangents from left to right, we treat the point (x, Ufi{x)) as belonging rather 
to the linear piece of the potential on the left than the one on the right. In other 
words we take to be left continuous. The formula (|5.2|l remains valid for all 
X in the interior of support of /i then. 

We want finally to describe the function outside of the support of /x 
(and on its boundaries). Suppose that the support of fj, is bounded from below: 

= infja; : x € supp{fj,)} > —oo. For all x < we have simply that U fJ.{x) = x 
and so the tangent is just the line {{t,t) : t E M}, which intersects the line 
{{t, -t):te K} in (0, 0), so we put ^'^(x) = 0. 

Suppose now that the support of /i is bounded from above: 6^ = sup{x : 
X G supp{p)} < oo. Then for all a; > 6^ we have U^{x) = —x and so the very 
point {x,Ufj,{x)) lies on the line {(i, — i) : t € R} and we put '^fj,{x) = x. This 
completes the definition of the function 'I'p, which coincides with the one given 
in0. 
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We want now to define the stopping time that results from this setup. At 
first glance we could have a small problem: following closely the case of measures 
with finite support would end up in a composition of an uncountable infinity of 
first hitting times of intervals (a;, \E'^(a;)) for x in the support of /j,. This would 
not make much sense. However there is an easy way to overcome this difficulty 
- we can use the fact that is increasing. Let {Mt : t > 0) be a continuous 
local martingale and denote its supremum by St = sup{Ms : s < t}. St then 
describes which values of type '^p.ix) have been reached so far, and so which of 
the intervals {x, \l/^(a;)) we are considering. We want to stop when exiting such 
an interval on its left boundary, i.e. when we reach an x such that the supremum 
so far is equal to ^^(a;). In other words we take: 

Tay = inf{t > : 5t > *p(Aft)}- (5.3) 

Theorem 5.1 (Azema and Yor (Sj). Let {Mt)t>o be a continuous martingale, 
Mq = a.s., with {M,M)ao = oo a.s. For any centered probability measure fj, 
on R, Tay defined through l^./jj) . is a stopping time in the natural filtration 
of (Mt) and Mt^-^ ^ A*- Furthermore, {MtATA-Y)t>o is a uniformly integrable 
martingale. 

Proof. From H5.3|l it is evident that Tay is a stopping time in the natural 
filtration of {Mt). The proof of the remaining statements is carried out through 
a limit procedure using approximation of U ^ by piece-wise linear functions. We 
define measures /i„ through their potentials. Let A ~ {ai, 02 ... } be the set of 
atoms of /i (i.e. /i({a;}) > iff 3! i : a; = a^) and Pn — ■ k ^ Z}. Define /.t„ 
through: 

• Vx e {ylUP„}n [-n,n] C/^„(x) = [/^(a;), 

• U fin is piece-wise linear, //„((— 00, — n — 1] U [rt + 1, 00)) = 0. 

It is straightforward that ?7/i„(a;) Ufi{x) for all x G M, which in turn implies 
fin ^ fJ-- For each of the measures fin consider the AY stopping time T„, which 
embeds /i„ as shown in Section^ Finally from the definition of a tangent, we 
see that lines defining Tn converge to tangents of Ufi and Tn T = Tay almost 
surely. Then Mt„ — > Mt a.s. and since fin fi, we obtain Mt/^^ ^ fi. To see 
that (MjaTay '■ t > 0) one can use assertion (v) of Proposition and the fact 
that ETn = J x'^dfinlx) < 00. □ 

It has to be noted that what we did in this section is essentially a simplified 
reading of Zaremba's work |126 | - simplified through the work of Chacon and 
Walsh. 

5.2. Nontrivial initial laws 

One of the advantages of the approach presented here is that we impose only 
one condition on the initial law: namely that its potential is greater than the 
potential of the measure fi we want to embed (and this is known to be a necessary 
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condition, see Section IS)). We can still give an explicit formulae for the stopping 
time Tay, for nontrivial initial laws. 

Let {Mt : i > 0) be a continuous, local, real- valued martingale with {M, M)^ - 
oo a.s. and with initial law Mq ^ v, J \x\di'{x) < oo. Let /i be any probability 
measure on K with finite first moment, i.e. ^ G M.^, such that Ufi < Uv. Using 
assertion (ii) from Proposition 12 . '6\ this implies J xdy{x) = J xdii{x) — m, and 

lim {Uv{x) + |a; — m|) — lim {Ufi{x) + |a; — m|) = 0. (5-4) 

\x\ — *oo — ^oo 

Then fi can be embedded in M by a stopping time, which is a generalization of 
Azema and Yor's solution. 

Consider a tangent to the potential Uii at a point [x, U ^{x)) (which is not 
an atom of jj) . We want to describe the points of intersection of this line with 
the potential of v. Studying (|2.3|l . H5.1|l and H5.2|l it is easily seen that the points 
are of form (t, Uv{t))^ where t satisfies 

Uv{t) ^ [^^{[x.oo)) - ^i{{-(x,x))\t+ I ydfi{y) ~ ydfj.{y). (5.5) 

We want to define a generalized barycentre function '^'^(x) as a certain solution 
of the above and the stopping time will be given as usual through 

TX^ = inf{t >0:St> ^;iBt)}. (5.6) 

Let us look first at a simple example. Let i? be a Brownian motion starting from 
zero and v G A4q be concentrated on a and b, in other words u ~ Bt^ ^. Then 
/z is also centered as Ufi < Uv. Let = mi{t : t € supp{^i)} < a be the lower 
bound of the support of ^ and be the standard barycentre function defined 
through H5.2|l . Writing the explicit formula for Uv in H5.5|l we obtain 

1 , for X < flp 

Ki-) = < ^'-^'^'r;f„^ ,fora,<.<5 
'i'fj.ix) , for X > b. 

Consider now the general setup. The equation (|5.5I) has either two solutions 
t]. < or solutions form an interval t G t~ < , see Figure 1. Let us 

denote these two situations respectively by / and //. The latter happens if and 
only if the two potentials coincide in a;. Finally, for a probability measure p let 
Op = iid{t : t G supp{p)}, bp = sup{a; : x G supp{p)} be respectively the lower 
and the upper bound of the support of p. Define: 



+2 



for X < 



^ { ' for a^<x<b^iil (5.7) 

X for (a^ < X <b^ if //) and for {x > b^j). 

A simple observation is required to see that the definition agrees with the intu- 
ition. Since Up < Uv < U5m, and Up and Uv coincide with U6m outside their 
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Figure 1: Possible intersections of tangents for nontrivial initial law. 



supports, the support of is contained in the support of /i, [aj,,5i^] C [a^,b^]. 
This grants that ^'^^ix) > x. 

It is also worth mentioning that the function ^fj;^ lost some of the properties 
characterizing 'I'p. In particular we can have '^^^(x\) x\ and '^^^(xi) > X2, for 
some X2 > xi. 

Proposition 5.2. Let {Mt : t > 0) be a continuous, real-valued, local martingale 
with {M, M)oo = oo, Mo ~ v, and fi e with Ufi < Uv. Define and T^^ 
through \5.1\j and i)5.6|) respectively. Then {M^^rp^^'^ is a uniformly integrahle 
martingale and Mrp^^ ~ fi. 

The proof of this result is analogous to the proof of Theorem l5.ll taking into 
account the above discussion of '^f,. 

We remark that the construction presented above might be used to mix or 
refine some of the known solutions to the embedding problem. For example if 
a, a]) = we can first stop at T^a.a and then use with u = ^{S-a + ^a)- 

5.3. Minimizing the minimum - the reversed AY solution 

In Sect ion [Em we saw how to derive Azema and Yor's solution to the Skorokhod 
problem using potential theory. It is easy to see that the same arguments allow 
us to recover another solution, which we call the reversed Azema- Yor solution. 
It was first studied by van der Vecht |121| (see also Meilijson [731)- Recall that 
Ix was the tangent to the potential Ufi in {x,Ufi{x)). We considered its inter- 
section with the line {{t,~t) : t £ R}, which was denoted (^'^(2:), — ^^(a::)). 
Consider now the intersection of Ix with the line {{t,t) : t £ M} and denote it 
by (6^(a;), 6^(a;)). A simple calculation (analogous to the derivation of ()5.2|l 
above) shows that: 

Q^i^) = -jT^ — / ydii{y). (5.8) 

/i((-oo,a;)) J(_oo,a:) 

This is a valid formula for x which is not an atom. Looking at the discrete case 
we see that we want to be right continuous, so we finally need to define: 

e^(x) = — ^ ^ / yd^^iy). (5.9) 
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Denote St — — mf{Ms s < t} the infimum process and define the following 
stopping time 

TrAY - inf{i >0:~st< O^iMt)}. (5.10) 

Corollary 5.3. Let {Mt)t>Q be a continuous local martingale, Mq — a.s., with 
(Af, M)oo — oo a.s. and let /i be a centered probability measure on M. Define 
through 1 5. .9)) . Then T^ay is a stopping time and MtATrA-v ^ Furthermore, 
(MtATrAY '■ t > 0) is a uniformly integrable martingale. As a consequence, if 
J^x^d^{x) — V < oo then ETrAY = v. 

We called this a corollary as this solution is still an Azema-Yor solution. 
Indeed if X ~ /i then TrAY is just Tay for v the law of —X and the continuous 
local martingale Nt = —Mt. This shows also that we could handle non-trivial 
initial laws for the reversed Azema-Yor solution. Speaking informally, in the 
original Azema-Yor solution one takes tangents from left to right and in the 
reversed version one takes the tangents from right to left. We will see below 
that it yields some symmetric optimal properties. 

5.4- Martingale theory arguments 

In the preceeding, we chose to discuss the Azema-Yor solution using one-dimensional 
potential theory. This methodology allowed us to understand well what is re- 
ally being done and what is really being assumed, opening itself easily to some 
generalizations, as we saw above and as we will see again in Section One 
can choose however a different methodology, which has its own advantages as 
well. We sketch now the heart of the proof that goes through martingale theory. 
These are the ideas that actually led to the original derivation of the solution 
in 0. 

Let / : [0,oo) — > M be an integrable function and denote its primitive by 
Fix) = f{y)dy. Then 

nI = F{St) - {St - Mt)f{St), t > (5.11) 

is a local martingale, where M and S are as in the previous section. This can 
be easily seen when / is continuously differentiable, since integrating by parts 
we obtain 

Nf = F{St) - f fiSu)dSu + f f{Su)dMu - f {Su^ Mu)f'{Su)dSu 
Jo Jo Jo 

= F{St)-F{St)+ f f{Su)dMu= f f{Su)dMu, (5.12) 
Jo Jo 

as {Su — My) is zero on the support of dSu- The result extends to more general 
functions / through a limit procedure^. Given a centered probability measure 
/X, we want to find T, which embeds /x, of the form T ~ mi{t > '■ St > 

^It can also be verified for / an indicator function and then extended through the monotone 
class theorem. For more on martingales of the form 115.111 see 1851 
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\E'(Aft)}. Our goal is therefore to specify the function ^. Suppose we can apply 
the optional stopping theorem to the martingale and the stopping time T. 
We obtain 

EF{St) = E(5t - MT)f{ST). (5.13) 

Note that Mt ~ p and St = 5" (My). We suppose also that ^ is strictly 
increasing. If we denote by p the law of St and p = p{[x,~\-oo)) its tail, the 
above equation can be rewritten as 

F{x)dp{x) = j {x — '^^^{x))f{x)dp{x), integrating by parts 

f{x)'p{x)dx = I {x — "^^^ {x)) f {x)dp{x) , which we rewrite as 



dpix) = \ dx, since / was a generic function . 

X — ^'"^(a;) 

We know that St = ^'(Mt), so that ]l{x) = F{Mt > x) = ¥{St > *(a;)) 
p(^(a;)). Inserting this in the above yields 

'i'{x)d4l{x) — xdjlix) = —'p{x)d^{x) 
^{x)d4i{x) +'p.{x)d-^{x) = xdji{x) 

d(^{x)'Jl{x)^ = xdp{x) 

*(a;)7l(a;) = / ydp{y), 

J [a;,+oo) 

and we find the formula (|5.2|l . 



5.5. Some properties 

The Azema-Yor solution is known to maximize the supremum in the following 
sense. Let p, M and S be as in the previous section, and R any stopping time 
with Mb^ ^ p and (Af/j^t : i > 0) a uniformly integrable martingale. Then for 
any z > 0, P(5'i{ > z) < V{Sta:y ^ This was observed already by Azema 
and Yor [2] , as a direct consequence of work by Dubins and Gilat , which in 
turn relied on Blackwell and Dubins We will try to present here a unified 
and simplified version of the argumentation. The property was later argued in 
an elegant way by Perkins ,90) and it is also clear from our proof in Section [5. II 
(as stressed when discussing measures with finite support, see page KMOjl . 

For simplicity assume that p has a positive density, so that 'p~^ is a well 
defined continuous function. Let f(x) = l[z.+oo) (2^)1 then = {St — z)^ — 
{St — Mt)l[z,+oo){St) is a martingale by H5.11|l . Applying the optional stopping 
theorem for {R A n) and passing to the limit (using the monotone convergence 
theorem and the uniform integrability of {Mp^^t : t > 0)) we see that zf{STi > 
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z) = E, AIbI [z. +00) (Sr)- Denote P(5ij > z) = p. It suffices now to note, that 

zp = 

z < 
P < 

which we wanted to prove. Note that the passage between the last two hnes is 
justified by the fact that ^E*"^ is increasing and JI is decreasing. 

This optimaUty property characterizes the Azema-Yor solution. Further- 
more, it characterizes some of its generalizations - for example in the case of 
Brownian motion with drift, see Peskir 9'S . It also implies an optimality prop- 
erty of the reversed solution presented in Section [5.31 Namely, with R as above, 
we have P(sij > z) < P(st^^y > z), for any z > 0, where St = — inf,tj<t Mu- 

The second optimality property says that this solution is pointwise the small- 
est one. More precisely, if for any stopping time S < Tay and Ms ~ Mt^^ then 
S — Tay a.s. Actually, it is a general phenomena that we will discuss in Section 
IHl along with some other general properties of stopping times. 

6. The i?^-embedding 

Given a solution T to the Skorokhod stopping problem we can ask about the 
behavior of supremum or infimum of our process up to time T. This question 
has attracted a lot of attention. In particular some solutions were constructed 
precisely in order to minimize the supremum and maximize the infimum and 
conversely. 

Let us denote by (Xt) the real valued process we are considering and fi a 
probability measure on the real line. The following problem: 

Find some necessary and sufficient conditions on (A", fi) for the existence of 
a solution T to the Skorokhod embedding problem such that = sup{|As| : 
s < T} is in L^, 

is called the iJ^-embedding problem. We will restrain ourselves to the Brow- 
nian setup, even though most of the ideas below work for real-valued diffusions 
on natural scale. We will present a solution of the iJ^-cmbcdding by Perkins. He 
actually gives a solution to the Skorokhod embedding problem, which minimizes 
X^. We then proceed to Jacka's solution, which is a counterpart of Perkins, as it 
maximizes X^. We mention that several authors have investigated in detail the 
behavior of the maximum of a uniformly integrable martingale and we gathered 
theirs works in Section riU.3l 

6.1. A minimal -embedding 

The problem of iJ^-embedding was also considered by Walsh and Davis [?7j 
but received a complete and accurate description in Perkins' work in 1985 
Using Doob's L^'-inequalities one can see easily that for p > 1 an iJ^'-embedding 



xdfi{x) ^ p'ifj^ijl^^ip)), thus 



^(p))' which yields 
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exists if and only if J \x\Pd^{x) < oo, and if p < 1 every embedding is in fact 
an 7?''-enibedding. For p = 1 the situation is more delicate, and it goes back to 
an analytical problem solved by Cereteli |^ . 

Theorem 6.1 (Cereteli-Davis- Perkins). Consider Brownian motion and a 
centered probability measure fi. The following are equivalent: 

• There exists an H^-embedding of ji. 

• The Skorokhod solution, described in Section ]!^. 1[ is an -embedding of 

• H{^i) ^ J^xl\^i>xdfi{x)\dX < oo. 

Perkins [3D| associated with every centered probability measure two func- 
tions: 7_ and 7+ (see the original work for explicit formulae). Then if 

St — sup-Bu and st = ~ inf i?„ (6-1) 

u<t "<* 

are respectively the supremum and infimum processes we can define: 

p = M{t>0:Bt^{-^+{St),j^{stm. (6.2) 

If the target distribution has an atom in 0, we just have to stop in with the 
appropriate probability. This is done through an external randomization. We 
take an independent random variable U, uniformly distributed on [0, 1] and put 

Tp = plt/>^({o}), (6.3) 

where P stands for Perkins. Tp is a (randomized) stopping time that embeds /x 
and Tp is an i?^-embedding if there exists one. Moreover, Perkins defined his 
stopping time in order to control the law of the supremum and the infimum of 
the process and his stopping time is optimal. 

Theorem 6.2 (Perkins). {Bt^Tp '■t>Q) is a uniformly integrable martingale 
and Btp ~ /i. Tp is an H^-embedding if and only if H{fi) < 00. Furthermore, 
for any solution T of the Skorokhod embedding problem for /i, for all A > 0, 

P(5'tp > A) <P(S't > A), (6.4) 
P(sTp > A) <P(sT > A), (6.5) 
^B'^p > A) < P(B^ > A). (6.6) 

Actually this theorem is not hard to prove - the stopping time is constructed 
in such a way as to satisfy the above. The hard part was to prove that Tp embeds 
11. The construction of Tp helps Perkins to give explicit formulae for the three 
probabilities on the left side of H6.4(l - (|6.6(l . Using Perkins' framework we could 
also define a different stopping time in such a way that it would maximize 
stochastically the supremum 5*. Of course what we end up with is the Azema- 
Yor stopping time (see Sections l5 . II and f5 . 5|l and this is yet an another approach 
which allows us to recover their solution in a natural way. 
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The work of Perkins has been generalized by Cox and Hobson ■ Several 

authors examined the law of supremum (and infimum) of a martingale with a 
fixed terminal distribution (see Section Fl (J. 3|l . More generally, the link between 
the Skorokhod embedding problem and optimal stopping theory was studied. 
We develop this subject in Section EH 

6.2. A maximal -embedding 

We saw in Sectional that the Azema-Yor construction maximizes stochastically 
the distribution of St, where Bt ~ H, fixed, in the class of stopping times such 
that {Bt/\T)t>o is a uniformly integrable martingale. Moreover, we saw also that 
its reversed version T^ay, minimizes the infimum. How can we combine these 
two? The idea came from Jacka and allowed him to construct a maximal 
embedding. We will describe his idea using the Chacon and Walsh methodology, 
which is quite different from his, yet helps, in our opinion, to see what is really 
being done. 

We will continue to work within the Brownian setting, even though ev- 
erything works in exactly the same way for continuous local martingales. We 
want to find a stopping time T such that Bt ^ fJ., {BtAT : t > 0) is a uni- 
formly integrable martingale and if R is another such stopping time, then 
^(-B^ > z) < P(-Bt > z), for z > 0. Naturally big values of B* come ei- 
ther from big values of S or s. We should therefore wait "as long as possible" 
to decide which of the two we choose, and then apply the appropriate extremal 
embedding. To see whether we should use big negative or big positive values 
we will stop the martingale at the exit time from some interval [— fc, fc], which 
we assume to be symmetric. This corresponds to cutting the initial potential 
with a horizontal line. The biggest value of k which we can take is obtained by 
requiring that the horizontal line be tangent to the potential C//n. 

For sake of simplicity we suppose that the measure ^ has a positive density. 
We need to determine the point x* in which the derivative oi becomes zero. 
Looking at H5.1(l we see that ]l{x*) — ^. In other words, we chose the median of 
II for X* . The value of k* is then given as the intersection of the tangent to the 
graph of Ufi in x* with the line y = —x, which is precisely k* — f^{x*), where 
^ is given by H5.2|l . 

As suggested, we first stop at the exit time from [—k*, k*]. Obviously Brow- 
nian motion stops either at fc*, in which case it will be finally stopped in- 
side [J7/i(a;*), oo), or at —k* in which case it will be finally stopped inside 
(— cxD, [//i(a;*)]. If we exit at the top, we want to maximize the supremum, and 
so we will use the standard Azema-Yor embedding: Ti — inf{t > T-k,k '■ St > 
fj.iBt)}- If we exit at the bottom, we need to minimize the infimum, so we will 
use the reversed Azema-Yor solution: T2 = inf{i > T_k^k ■ ~St < 0^(-Bt)}. We 
have then the following proposition. 

Proposition 6.3 (Jacka j58p . In the above setup 



F (S^ > z) < min {l,M([V';'(^), +00)) + /.((-cx), Q-'iz)])}. (6.7) 
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Furthermore, this inequality is optimal as equality is attained for a stopping time 
Tj defined through 

Tj = Ti AT2 = inf {i > T_fc.,fc. : B; > niax(*^(SO, e^(Bt)}- (6-8) 

If the measure /i has atoms the above Proposition is stiU true taking left in- 
verses of VE*^ and and extending correctly the definition of k* . More precisely, 
if we write m = inf{x : Jl{x) > i} for the median of /i, then if /i({m}) > we 
have a family of tangents to U /i in m. The value ^'^(m) results from taking the 
left-most tangent (cf. Section WTT^i and not the horizontal one. We have there- 
fore k* > \E'^(to). The difference is given by the slope of the left-most tangent, 
{2jl[m) — 1), times the increment interval (^'^(to) — m). We obtain therefore: 
k* = 4'^(m) -|- {2jl{m) — l)(^'p(m) — to), which is the quantity given in Lemma 
6 in Jacka [SB]. 

7. Root's solution 

All the solutions described so far have one thing in common: stopping times are 
defined through some functionals of Brownian motion up to time T. Intuitively, 
if one searches for a stopping time with minimal variance it should have a 
very simple history-dependence structure, preferably it should just be the first 
hitting time of some set. Of course this cannot be just a subset of the space 
R but rather of the time-space R+ x R. Root |103j showed that indeed for any 
centered probability measure on M with finite second moment, there exists a 
set C R+ X M, such that the first entrance time of {t,Bt) into i?^, embeds 
fi in Brownian motion B. The first hitting times for the space-time process of 
Brownian motion correspond to reduites of potential measures and we should not 
be surprised that to investigate Root's solution Rost |ll)4 |. | 1U5| . |1U6| needed a 
fair amount of potential theory. 

7. 1 . Solution 

The key notion is that of a barrier. 

Definition 7.1. A subset R of [0, +00] x [—00, +00] is a harrier if 

1. R is closed, 

2. (-foo, x) E R for all x G [—00, +00], 

3. {t, ±00) £ R for all t G [0, +00], 

4- if {t, x) £ R then [s, x) £ R whenever s > t. 

Theorem 7.2 (Root [103J). For any probability measure ^ onR with J x^d^(x) — 
V < 00 and J xdfj,{x) — 0, there exists a barrier Rf^ such that the first hitting 
time of the barrier 

= Tb., - inf{t > : (t, Bt) G i?^}, (7.1) 
solves the Skorokhod problem for fj,, i.e. Bt^^^ ^ fJ- and IETr^ = v. 
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Remark on notation: We write for Root's stopping time, which solves the 
Skorokhod embedding problem for the measure /i and Tr for a general "Root's 
stopping time" that is for the first hitting time of some barrier R. We write T^^ 
for the first hitting time of a particular barrier . 

The proof is carried out through a hmit procedure. It is not hard to see that 
a barrier i?^, exists for a measure with finite support fi — X]"=i cy-i^xi ■ Rfj. is just 
of the form 

The points (6^) are not given expHcitly but only their existence is proven. One 
then needs some technical observations in order to pass to the limit. 

Let H denote the closed half plane, H = [0, +00] x [—00,00]. Root |103| 
defines a metric r on the space C of all closed subsets of H. Map H homeomor- 
phically to a bounded rectangle F by {t,x) ij^^ ^^^| ). Let F be endowed 
with the ordinary Euclidean metric p and denote by r the induced metric on H . 
For C,D eC put 

r(C, D) = maxjsup r{x, D), sup r{y, C)}. (7.3) 

x<£C y£D 

Equipped with r, C is a separable, compact metric space and 7?., the subspace 
of all barriers is closed in C and hence compact. Furthermore, this metric allows 
us to deal with convergence in probability of first hitting times of barriers. More 
precisely, the application which associates with a barrier R its first hitting time, 
i.e. R —> Tr, is uniformly continuous from (TZ, r) into the set of first hitting 
times of barriers equipped with convergence in probability. 

Lemma 7.3 (Root | 1U3| . Loynes [7U|). Let R be a barrier with corresponding 
stopping time Tn. IfV(Tii < 00) = 1, then for any e > there exists 6 > such 
that if Ri € TZ and r{R, Ri) < 6 then ¥{\Tji — Trj | > e) < e, where Tji-^ is 
defined via \7.1^ . If¥{Tji = 00) — 1 then for any K > there exists S > such 
that r{R, Ri) < S implies ¥{Tr^ < K) < e. 

If a sequence of barriers converges, Rn — - — > R with ]E7/i;„ < K < 00, then 
ETr<K and P(|rR,„ - Tr\ > e) > 0. 

With this lemma the theorem is proven taking a sequence /i„ of probability 
measures with finite supports converging to /x, =^ p. 

7.2. Some properties 

We start with some easy properties and then pass to the more involved results 
of Rost. The first property, established by Root |1(J3| for his stopping time, 
asserted the equivalence between integrability of _B|^ and T^, for k G N. We 
quoted it, in a more general form, in Preliminaries (Proposition!^^ as it is 
true for any stopping time T such that {BtAT : t > 0) is a uniformly integrable 
martingale. 
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Several further properties were described by Loynes fTUj, one of which is 
particularly important for us. In order to state it, we need to introduce the 
notion of a regular barrier. Let i? e 7?, be a barrier. Define 

= inf |y e [0, oo] : [0, +oo] x {y} C i?| 
a;^ = sup{y e [-00,0] : [0, +oo] x {y} C R}, (7.4) 

the first coordinates above and below zero respectively at which i? is a horizontal 
line. 

Definition 7.4. We say that a barrier R E TZ is regular if [0, +oo] x [x^, +oo] C 
R and [0, +oo] x [— oo, x^] C R. 

We say that two barriers i?i,i?2 are equivalent, Ri ~ R2, if x^^ — x^^ , 
x^^ — x^^ and the two barriers coincide on [0, +00] x [x^^ , x^^]. 

Thus a regular barrier R is one that is composed of horizontal lines above x^ 
and below x^. Clearly each barrier is equivalent to exactly one regular barrier. 
Furthermore, for a barrier R we have that Tr < T^r and so stopping times 
corresponding to two equivalent barriers i?i ^ R2 are equal Tr^ — Tr^ . This 
shows that we can limit ourselves to regular barriers. 

Proposition 7.5 (Loynes [70 ). Let fi be a centered probability measure on R 
with J x'^d^{x) — V < 00. Then there exists exactly one regular barrier R^, such 
that Btpi^ ^ and IETr^ = v. 

Loynes showed also that a wider class of probability measures /i can be em- 
bedded in Brownian motion through Root's construction. However, the unique- 
ness result does not carry over to this more general setup. 

The fundamental property of Root's construction is that it minimizes the 
variance of the stopping time. This conjecture, made by Kiefer |64| . was proved 
by Rost. 

Theorem 7.6 (Rost |lU7j). Let ^ be a centered probability measure on M 
with J x^d^{x) = V < 00 and let denote Root's stopping time embedding ji. 
Assume that E(T^)^ < 00. Then, for any stopping time S such that Bs ^ ^ 
andES^ v, we have: E{T^f <ES^ and E{T^)^ =ES^ if and only ifS^T^ 
a.s. 

More generally, for anyO<p<l, E{T^)p >ESP and for any p > I, E{T^)p < 
ESP. 

We will try to explain briefly, and at an intuitive level, how to arrive at 
the above result. Actually, Rost considers the more general setup of a Markov 
process (that is assumed transient for technical reasons but the extension is 
fairly simple). 

As in SectionO let [Xt : t > 0) be a Markov process (relative to its natural 
filtration {Tt)) on a compact metric space E associated with a semigroup (P^) 
and a potential kernel C/^ . Let v and /i be two probability measures on the Borel 
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sets of E with cr-finite potentials iiU''^ and lyJJ'^ . The measure ly is the initial 
law: Xq ~ V and /z is the law we want to embed. Recall the characterization 
of the balayage order given in Section IT^ For existence of an embedding, the 
measures have to satisfy /it/"^ < i/U^ . Then, there exists a stopping time T 
which embeds 

We are looking for a stopping time T with minimal variance. It is shown 
actually that it satisfies the stronger property of minimizing J^^j, f{Xs)ds 
for Borel, positive functions /, and any t > 0. This in turn is equivalent to 
minimizing vP^rpU^ for all t>Q. The idea is to look at the space-time process. 
We consider a measure iV on £' x R defined through 



We take its reduite N (with respect to the semigroup of the space-time process) 
and show that it is of the form 



with N, decreasing, right continuous and Nt — vP^^^^U for a certain stopping 
time with vPtp^ = It follows that Tr minimizes vP:^j,U'^ . The last step 
consists in showing that Tr is actually Root's stopping time for a certain barrier 
R. This is not surprising after all, as we saw that reduites are realized as hitting 
times of some sets and Root's stopping time is just a hitting time for the space- 
time process. 

7.3. The reversed barrier solution of Rost 

In this section we describe a solution proposed by Rost, which is a counterpart 
for Root's solution as it maximizes the variance of the stopping time. We have 
not found it published anywhere and we cite it following the notes by Meilijson 
|74| . The basic idea is to look at reversed barriers. Intuitively we just substitute 
the condition that if a point belongs to a barrier then all the points to the right 
of it also belong to the barrier, by a condition that all the points to the left of 
it also belong to the barrier. 

Definition 7.7. A subset p of [0, +oo] x [— oo, +oo\ is called a reversed barrier 





1. p is closed, 



2. (0, x) <E p for all x G [— oo, +oo], 



3. {t, ±oo) e p for all t e [0, +oo 



4- if {t,x) G p then {s,x) e p whenever s <t. 
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Theorem 7.8 (Rost). For any probability measure /i on R with /i({0}) = 0, 
J x^dfj,{x) — V < oo and J xd^(x) = 0, there exists a reversed barrier such 
that the stopping time 

Tp^ =inf{i>0: ep^, (7.5) 

solves the Skorokhod problem for fi, i.e. Bp^ ^ fi and lETp^ — v. Furthermore, 
if S is a stopping time such that Bg ^ fJ- and KS — v then for any p < 1 
ETP^ < ESP, and for any p > I ET'p^ > ESP. 

Note that there is a certain monotonicity in the construction of a reversed 
barrier, which is not found in the construction of a barrier. For a point x in the 
support of fi, say cc > 0, we must have the following property: there exists some 
tx such that {tx,x) e and {tx,y) (f. p^ for any y € [0, x). In other words, 
the reversed barrier p^, on the level x, has to extend "more to the right" than 
anywhere below, between and x. This, in a natural way, could lead us to revise 
the notion of a reversed barrier and introduce regular reversed barriers, which 
would satisfy Vx > if {t,x) £ p then {t,y) £ p, \/y £ [x,+go) and likewise 
for X < 0. We could then obtain uniqueness results for Rost's embedding in an 
analogy to Proposition !?. 51 above. We do not pursue these matters here, but we 
note that, according to our best knowledge, they have never been investigated. 

7.4- Examples 

Explicit Root's or Rost's solutions are very rarely known, still we can give some 
examples. First, a trivial example deals with Gaussian measures, li p = Af{0, v) 
then the barrier Ry = {{t,x) : t > v} yields a deterministic stopping time 
Tji^ = V, which naturally embeds the measure p. 

More sophisticated examples deal with square root boundaries. Shepp 
and Breiman [lOj . independently, considered the following stopping times*: 

aa^c = inf{t > : \Bt\ = cVt + a}. 

These are easily seen to be Rost's stopping times as aa.c = Tp^_^, where pa,c = 
{{t,x) e IR+ xR : x^ > c{t + a)} is a reversed barrier. As Bcr^^ = c^cja.c + a, 
to characterize the law of B^^ ^, it suffices to characterize the law of ctq c, which 
in turn is equivalent to giving its Laplace transform. This can be done using 
exponential martingales or through a reduction to the first hitting times of a 
given level for a radial Ornstein-Uhlenbeck process. We do not give the explicit 
formulae and refer to Going-Jaeschke and Yor 03] (see for example the formulae 
on page 322) for a recent and extensive study which includes the subject. 
Novikov [521 considered the following stopping time: 

Ta,6,c = inf{i > : St < -a + b^t + c}, (7.6) 

*More precisely, Breiman |in| worked with T = inf{t > 1 : > c\/t} conditionally on 
{Bi =0}. 
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where a > 0, c > and 6c^/^ < a. This is Root's stopping time for a barrier 
Ra,b,c = {(^jS;) e K+ X R : cc > — a + b^/t + c}, Tr^ ^ ^ = Ta,b,c- Again, charac- 
terizing the law of Br^ ^ ^ is equivalent to characterizing the law of Ta,b,c itself. 
To do so Novikov used a general family of exponential martingales of Brownian 
motion and the optional stopping theorem. In particular Novikov showed that 
E ^ ^ < cx) if and only if 6 > Z2v , where Zy is the largest root of the equation 
Dy{z) — 0, for Dy the parabolic cylinder function of the parameter v. 

The stopping times aa,c and Ta^b,c served Davis 26 to attain the optimal 
lower and upper bounds on the proportion E \Bt\^/^Tp/^, where T is a stop- 
ping time, ETP/^ < oo, p > 1. This is not surprising, as we would expect that 
to attain the bounds one would have to consider stopping times which, with 
distribution of Bt fixed, minimize and maximize the expectation ET^'/^, that 
is Root's and Rost's stopping times (cf. Theorems 17.61 and I7.8|) . 

Finally we mention two more papers which involve explicit calculations for 
Root's and Rost's stopping times. For a barrier R it is naturally possible that 
P(Tr = oo) >0. Robbins and Sigmund [11701 calculate the probabilities P(Th = 
oo) or P{Tb, > u), u fixed, for the barriers of the type R = {{t,x) : x > fit)}, 
where / is a function from a special family, which includes functions such as 
fit) — ct + aoT: f(t) = \J{t -\- e) \og{t + e). Lebedev [^21 has calculations, for an 
arbitrary d-dimensional diffusion, of the first moment of Rost's stopping times 
which are the first passage times through a time-dependent, smooth boundaries. 

7.5. Some generalizations 

The problem we want to solve is a certain generalization of the one solved 
by Rost. In jl()7| Rost showed that among all stopping times which solve a 
given Skorokhod embedding problem, the stopping time defined by Root has 
minimal second moment (which is equivalent to having minimal variance). In 
the Skorokhod embedding problem, the distribution oi Bt is fixed. Here we only 
fix two moments of this distribution and want to find the stopping time with 
the minimal variance. That is: 

Problem. Let us fix 4 > p > 1 and a constant Cp > 0. Among all stopping times 
T such that KBt — 0, ET = MB^ = 1 and M\Bt\^ = Cp, find the element Tmin 
with minimal variance. 

This problem is also linked with the optimal bounds on E | Bt | ^ / E T''/^ obtained 
by Davis (26j, which we mentioned in the previous section. Davis 26 gives a 
solution to our problem without the restriction E \Bt\p = Cp. 

Unfortunately, we did not succeed in solving this problem. We know it has 
a solution and we will present a certain conjecture about it. 

Proposition 7.9. The stopping time Tmin exists. 

Proof. Recall Definition 17.11 and (|7.1|) . It is an immediate observation that the 
stopping time we are looking for will be Root's stopping time for some barrier 
R. Indeed, if the /i ~ Bt,^^^ is the distribution of Bt^^^, then there exists a 
barrier i?^ such that Root's stopping time Tr^ yields -Btr^ ~ Moreover, 
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from Theorem [721 we know that ET^^ < ET^^,, and so ET^^ = ET^^ from 
the definition of Tmm- Therefore, using Theorem 17.61 again, we conclude that 

Tr^ = Tmin a.s. 

We can hmit ourselves to Root's stopping times. Let 



Ac^ = |m e : J xfj.{dx) = 0: y x^n{dx) = 1j y \x\P^^{dx) = Cp 

For every /i S Ac^ there exists a unique regular barrier such that -Bth^ ~ Z^- 
Let 7?.c be the subset of regular barriers corresponding to the measures in Ac ■ 
Let H denote the closed half plane and recall the definition H7.3|l of the 
metric r, under which C the space of all closed subsets of H, is a separable, 
compact metric space and TZ, the subspace of all barriers, is closed in C and 
hence compact. 

We can introduce an order relation in TZcj, given by: C ^ D 4=> ET^ < ET^, 
for C, D £ TZcp ■ If we prove that there exists Rmin - a minimal element of TZc^ 
relative to =^ - then, we will have Tmin = Tr^-^. In other words 

// there exists a Rmin G Ti-cp such that 

ETI . = inf ETL then Train =Tii^.^. 

To complete the proof we need to show that TZcp is closed under r. Indeed, 
choose a sequence of barriers i?„ G Ti-cp, such that ET|.^ \ infflg^^^ 1ET|.. We 
can suppose that i?„ converges (possibly choosing a subsequence) in r to some 
barrier R. To conclude that R = Rmin we need to know that R E TZcp, which 
follows if TZc is a closed subset of TZ. 

In order to prove that TZcp is a closed subset of TZ we essentially follow the 
ideas used by Root in the proof of Theorem 17.21 Take any sequence i?„ ^ R 
with Rn e TZcp- Then ETr^ = 1 for all n. It follows from Lemma f7. HI that for 
any e > 0, P{\Tr — TrJ > e) >0 and ETr < oo. Choosing a subsequence, 

n — ^oo 

we may assume that Tr^ — ^ Tr a.s. Furthermore, we can always construct a 
measure /i G Acp with J x^dfi{x) < oo, which implies ET^^^ < oo by Proposition 
im so that we may assume the sequence Tr^ is bounded in L^, and therefore 
uniformly integrable, hence converges in L^. In particular ETji = 1. Finally, 
Proposition 12 . II also grants the existence of a universal constant C4, such that 
EB^ < CiET'^, which implies that the sequence {Btp^J) is bounded in L"*. 
It converges almost surely, thanks to the continuity of paths and therefore it 
converges in . We see then, that EI^BtrI^ = Cp- This implies that R e TZcp 
and proves that TZcp is a closed set. We conclude that the stopping time Tmin 
exists and is equal a.s. to Tr for a certain barrier R. □ 

We do not know how to obtain Tmin explicitly. We do not even know if 
it is unique. Still, we want to give a simple observation, which motivates the 
conjecture below. We will let Cp change now, so we will write Tmin = T^ii,! = 
Tr^^ and vcp := ET^ . 
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Put c* = E|Bi|P, then the minimal stopping time is trivial T^f^^ = 1 and it is 
unique. It corresponds to the vertical barrier N = {{x,t) : t > 1}. Take any c,, > 
and define an order relation in the set TZcp hy: C < D iff r(C, N) < r{D, N). 
The minimal element R^i„ in TZr , relative to this order, exists as TZc is closed. 
Denote the stopping time corresponding to this barrier by T^^^ = Tj^n The 
above observation suggested us the following conjecture: 

Conjecture: The two minimal stopping times coincide a.s.: r^j„ = Tmin- 

This conjecture yields a manageable description of the stopping time - solu- 
tion to our problem. This comes from the fact that it can be translated somehow 
into information about the distribution of Brp-^p . However, we do not know how 
to prove it. We only have some ideas that we present now. 

Assume that 1 < p < 2. Consider / : [0, 1) [1, oo), f{cp) — Vcp- We will 
now show that / is a continuous function. We start with the right-continuity. Fix 
Cp < 1 and take a sequence decreasing to Cp. We have to show that for each 

subsequence 0"*= there exists a sub-subsequence Cp*"' such that v converges 

to Vcp- Take an arbitrary subsequence of Cp and denote it still c^. For simplicity 
of notation put c := Cp, v := w(cp), R = Rc^, Cn = Cp and f„ = Vc^. It is an 
easy observation that Vn cannot converge to anything smaller than v as such 
a situation would yield a stopping time with smaller variance than T^^^. So it 
is enough to find a subsequence of barriers i?„j^ converging to R, Rn^. € 7?.c„ , 
such that the corresponding stopping times Tr^^ will converge in L"^. 

Let n be the distribution of Bt'^ ■ Define a family of probability measures 

by 

g 

fis = 2(^{-i} + + (1 - ■^)M> for < s < 1. 

Observe that J xfis{dx) = 0, / x'^^s{dx) = 1 and J \x\P^s{dx) = c + s(l — c) > c. 
Corresponding to our sequence of (c„) we have a decreasing sequence (s„), s„ \ 
0, / Ixl^ ^s„{dx) = Cn- Let Rn be the sequence of barriers such that Bt^^ ~ A^s„- 
Note that Rn E TZc„ ■ Wc need also to point out that / x^iis{dx) < J x^fi{dx) < 
00 (this implies that the family Tr^ is uniformly bounded in L^). Now wc can 
choose a converging subsequence Rnk and the corresponding stopping times 
converge with ET^ —> v. 

To prove the left-continuity of / we follow the same reasoning, we just have 
to define the appropriate measures which will play the role of ■ Fix a constant 
a > 1 so that -^^^—^ < c. Define a measure v ^ [1 — -^)5{o} + 2a^('^{-a} + ^{o})) 
which plays the role of + ^{i}) above. We defined the measure so as to 

have: / xu{dx) = 0, / x'^v{dx) = 1 and / \xY'v{dx) = ^5^- Now, for < s < 1, 
we take r}s = sv + {1 — s)^, where n Bt-^ . ■ It follows that J xr]s{dx) = 0, 
J x'^ris{dx) = 1 and / \x\Pris{dx) = c + s{-^:^ - c) < c. 

It is an easy observation that /(c) > 1 and /(c) = 1 if and only if the 
variance of the corresponding stopping time T^i„ is 0, that is if and only if 
'^min = * a.s. for some t > 0. Then, since ET^-^^^ = 1, wc have T^^^n = 1 a.s. and 
so c = E|i?i|P. In other words, / has one global minimum. We believe that / is 
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a convex function. 

8. Shifted expectations and uniform integrability 

So far, throughout this survey, we have considered the case when Ufi < Uv, 
where v is the initial law and jj, is the measure we want to embed (recall Def- 
inition |^] which gives Ufx). A natural question to ask is: what about other 
probability measures? Can they be embedded into a continuous local martin- 
gale {Mt : t >{)) as well? The answer is yes. But we have to pay for this - the 
stopped martingale will no longer be uniformly integrable. 

If (MtAT : i > 0) is uniformly integrable, T < oo a.s., then EA/q = EMt- 
Suppose both v and /i have finite expectations but / xdv{x) ^ / xd^{x). This 
corresponds to the simplest situation when we do not have the inequality be- 
tween the two potentials: U ji ^ Uv. Then, if T embeds /x, the martingale 
{MtAT '■ t > 0) cannot be uniformly integrable, as KMq ^ EAf^- The same 
applies to the situation when / |a;|dz^(x) < oo and / \x\dyL(x) = oo. 

It is very easy however to point out a number of possible stopping rules 
which embed ^, in the case of / \x\d[i(x) < oo. It suffices to note that our usual 
assumption: (M, M)oo = oo a.s., implies that Ta = mi{t > : Mt = a} is finite 
a.s. for any a G M. We can therefore first wait till M hits m — J xdiJL{x) and 
then use any of the embeddings described so far applied to the shifted process 
starting at m. 

We have to see therefore two more properties. Firstly, that any probability 
measure with J |a;|(i/i(2;) = oo can be embedded in M, and secondly that in case 
of / \x\d^{x) < oo but U/i ^Uv we necessarily loose the uniform integrability. 
Resolution of our first problem is trivial and was proposed by Doob as noted on 
page 02] (see also exercise VI. 5. 7 in Revuz and Yor jM])- A different solution 
is found in Cox and Hobson |23| who generalized the Perkins stopping rule in 
such a way that it works with any distribution on R. The construction itself is 
rather technical and we will not present it here. We refer to Section 1!?. 201 and to 
the original paper for details. 

The second question asked above is also simple^. Suppose that Mq ~ i' and 
Mt ~ H- Then for any a; € M the process {Mt^T — x : t > 0) is a martingale and 
therefore (|MtAT — x\ : t > 0) is a submartingale. In particular the expectations 
of the latter increase and therefore —Uiy{x) = E | Mq — a;| < E | A/^— 2;| — —U^i{x) 
for any x € R, or equivalently Ufi < Uv. We have thus proved the following 
proposition: 

Proposition 8.1. Let [Mt : t > 0) be a continuous, local, real-valued martingale 
with {M, M)oo = oo a.s. and Mq ^ v . For any probability measure /i on R there 
exists a stopping time in the natural filtration of M which embeds fi, that is 
Mt ~ /i. Furthermore, the stopping time can be taken so that {Mt^r : ^ > 0) is 
a uniformly integrable martingale if and only if U fx < Uv. 

^Strangely enough we found it treated only in notes by Meilijson 1741 sent to us by the 
author together with remarks on the preliminary version of this work. 
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A natural question therefore is how to express the idea that the stopping 
time should be "small", if we cannot use the uniform integrability condition? 
This brings us to the question of various types, or properties, of stopping times. 
We gather and discuss them now. 

Definition 8.2. A stopping time T is called minimal if for any stopping time 
S <T, Ms Mt implies S^T a.s. (cf Monroe Cox and Hobson J^). 
A stopping time T is called standard if there exists a sequence of bounded 
stopping times Tn, with limTn — T a.s. and \imU C{Mt„)(x) = UC{Mt){x) > 
—oo for all X gM. (cf. Chacon (1 7^ , Chacon and Ghoussoub '18't, Falkner f,%y ). 
A stopping time T is called ultimate if for any probability measure p with 
Up > Ufi, where Mt ~ /i, there exists a stopping time S < T such that Ms ^ p 
(cf. Meilijson ^gj, van der Vecht \MM). 

For T a standard stopping time we always have EA/t — (as — oo < 
UC{Mt) < U6q), while minimal stopping times exist in a broader context and 
they seem fundamental for the extension of the Skorokhod embedding to non- 
centered cases. The following proposition explains that the minimality concept 
also applies in the centered case: 

Proposition 8.3. Let T be a stopping time such that EMt = 0. The following 
are equivalent 

1. T is minimal, 

2. T is standard, 

3. {MtAT '■ t > 0) is a uniformly integrable martingale. 

The equivalence between 1. and 3. was first obtained by Monroe [ZHIj who 
made an extensive use of the theory of barriers (see Section0l. It was then argued 
in a much simpler way by Chacon and Ghoussoub ^Hl- They showed, using one 
dimensional potential theory, that a minimal stopping time, for centered final 
distribution, is the same as a standard stopping time (this was also shown by 
Falkner PE])- Therefore one can deduce properties of minimal stopping times 
from those of standard stopping times, which in turn are easy to establish (see 
Chacon [HI). 

An immediate conclusion from the above Proposition, is that any solution 
to the Skorokhod embedding problem in the classical setup is minimal. It seems 
that in the non-centered setup minimality is the right condition to impose on 
the stopping times (Cox and Hobson (23). The following proposition assures 
that it is also a plausible condition: 

Proposition 8.4 (Monroe [78" ). For any stopping time S there exists a min- 
imal stopping time T < S, with Ms ^ Mt- 

We end this section with a result about ultimate stopping times. Namely, we 
treat the question: is it possible for a stopping time to be standard and ultimate 
at the same time? A negative answer is given by the following 
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Proposition 8.5 (Meilijson |72|. van der Vecht [121"). If T is a standard 
and ultimate stopping time and fi ^ Bt then /i({a, 5}) = 1, for some a < < b. 

The proof consists in showing that such a solution would have to coincide 
with both the standard Azema and Yor solution and its reversed version (see 
Sectional, which in turn are shown to coincide only in trivial cases. 

9. Working with real- valued diffusions 

As the reader may have have observed, the Chacon and Walsh methodology 
does not rely on any special property of Brownian motion. We underlined this 
in Section 13.81 and in Section 15.11 while proving the Azema- Yor construction. 
There is only one crucial property that was used: the way potential changes 
if we stop our process at the exit time of [a, 5], which is described in (vii) of 
Proposition 12.31 This is also the only property used by Zaremba |126j in his 
proof of the Azema- Yor solution. 

This property stays true however, for any real-valued diffusion on natural 
scale. Therefore, generalizing this framework to real-valued diffusions with con- 
tinuous, strictly increasing scale functions is easy. Hall (see Section I3.4|l 
treated Brownian motion with drift. Recurrent diffusions were treated in the 
original work of Azema and Yor and the general case was considered by 
Pedersen and Peskir ^^l- We present here a complete description following 
mainly a recent paper by Cox and Hobson [23 • We note that more general, 
not necessarily continuous processes, were treated as well. In Section 13.151 we 
presented the solution developed by Bertoin and Le Jan and in Section l!j.21l 
we discussed the solution for the age process of Brownian excursions developed 
by Obloj and Yor 

Let {Xt)t>o be a regular (time-homogeneous) diffusion taking values in an 
interval / C M, Xq — a.s. Then, there exists a continuous, strictly increas- 
ing scale function s : I ^ M. and Mt = s{Xt) is a local martingale on s(/). 
Furthermore, we can choose s(0) = (see for example Revuz and Yor p. 
301). 

We transform the embedding problem for X into an embedding problem for 
M. Let fj, he a probability measure on /, which we want to embed in X. Define 
a measure /i" through 

fi^iB)=fi{s-\B)), BeBis{I)), 

that is if a random variable ^ ^ ^J. then s(^) ~ /i*. It is easily seen that if we 
find a stopping time T such that Mt ^ fJ^'' then Xt ^ and the embedding 
problem for X will be solved. 

At first glance one would like to conclude applying our previous results to M . 
However some care is needed as s{I) might be bounded and the local martingale 
M will not be recurrent. 

We assume that J\x\d^^{x) — J |s(u)|d/i(u) < oo and write m = Jxdfi^{x) = 
J s{u)djj.{u). There are three distinct cases: s(/)° = M, s{I)° = (—00,0;) or 
s(/)° — {a, 00) and s{I)° — (a, (3), which we now discuss. 
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1. s(I)° = R. This is a trivial situation since the local martingale (Mf : t > 0) 
is recurrent and we can use any standard construction, such as the Azema- 
Yor solution (see Section [S]), to embed /x'* in M. 

2. s(I)° = (— oo,q;). In this case (Mf : t > 0) is a local martingale that 
is bounded from above by a. Suppose T is a stopping time such that 
Mt ^ /^^ and let 7„ by a localizing sequence for M. Then, by Fatou 
lemma, 

(-m) = E{~Mt) = E{ lim -MtAjJ < liminf E(-Mta7„) = 

and so m > is a necessary condition for the existence of an embedding. 
It is also sufficient: note that m < a from the definition of fi^, and the 
stopping time Tm = inf{t > : Mt = m} is a.s. finite for m > 0. We 
can put T = r,„ + Tay ° ^t,„ , where Tay is defined through H5.3|l for 
the measure /i*' shifted to be centered. The stopping time T embeds /i*, 
Mt ^ fi'^. Of course, we could also use some different embedding in place 
of Tay. 

s(I)° = (a, oo). A completely analogous reasoning to the one above gives 
that m < is a necessary and sufficient condition for the existence of a 
solution to the embedding problem for ^. 

3. s(I)° = {a, (5). In this case we have a bounded martingale and we can use 
the dominated convergence theorem to see that m = E Mt = lim E Mta.j„ = 
0. It is of course also a sufficient condition as we can then use any standard 
embedding, such as the Azema-Yor solution (see Section EJ. 

We point out that the simplifying assumption we made in the first place (namely 
/ \s{u)\dij,{u) < oo) is not necessary. One can see that in the first case above a 
solution always exists and in two last cases the assumption is actually a necessary 
condition. We refer to Cox and Hobson 23 for more details. 

10. Links with optimal stopping theory 

There are two main ways to establish a link between an embedding problem 
and an optimal stopping problem. Firstly, one can prove that a solution to 
the optimal stopping problem is given by a certain solution to the embedding 
problem (for some law). Secondly, one can look for such an optimal stopping 
problem that its solution solves a given embedding problem. The former is 
discussed in Section Hi). II and the latter in Section [lU.2l 

Closely connected with these subjects are also the works which followed and 
generalized the work of Perkins in studying the interplay of laws of the supre- 
mum, the infimum and the terminal law of a martingale. The main literature in 
this direction is discussed in Section [10.31 

In this section we only give a brief overview of the above-mentioned topics, 
with no claim at describing that vast domain in a complete way. In particular 
we do not treat optimal stopping theory itself. For a good and brief account of 
its basic ideas as well as some new developments, see chapter 1 in Pedersen (HRlj . 
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10.1. The maximality principle 

Let (/) be a non-negative, increasing, continuous function and c a continuous, pos- 
itive function. Consider the following optimal stopping problem of maximizing 







0(5,) - / c{Bs)ds , (10.1) 



over all integrable stopping times t such that 



E 







c{Bs)ds < oo. (10.2) 



Suppose, in the first instance, that (/)(a;) = x and c{x) = c > is constant. 
In this formulation the problem was solved by Dubins and Schwarz |33| in an 
article on Doob-like inequalities. The optimal stopping time is just the Azema- 
Yor embedding for a shifted (to be centered) exponential distribution (with 
parameter 2c). 

The methodology allowing to deal with problems like IjlO.lll was presented 
by Dubins, Shepp and Shiryaev [34j who worked with Bessel processes and 
4>{x) = x, c{x) — c > 0. The setup of 4>{x) — x and c{x) a non-negative, 
continuous function, was then treated by Peskir in a series of articles ([HJi !^2| . 

IMJ. 

Theorem 10.1 (Peskir [91). The problem ilO.l]) . for ip{x) — x and c{x) a 
positive, continuous function, has an optimal solution with finite payoff if and 
only if there exists a maximal solution of 

2c(ff(s))(s-5(s)) 

which stays strictly below the diagonal in IS?, i.e. g*(s) < s for all s G K. The 
Azema-Yor stopping time = ini{t > . Bt < (?*(S't)} is then optimal and 
satisfies J^IO.S}^) whenever there exists a stopping time which satisfies ^10.S\) . 

Actually the above theorem was proven in the setup of any real, time- 
homogeneous diffusion to which we will come back later, as it allows us to 
recover the solution for a general as a corollary. The characterization of exis- 
tence of a solution to H10.1|) through existence of a solution to the differential 
equation H10.3|l is called the maximality principle. We note that Dubins, Shepp 
and Shiryaev '34' had a different characterization of g■^, through the condition 
M£) .1. " 

^ t— >oo 

Let now be any non-negative, non-decreasing, right-continuous function 
such that (f>{Bt) — ct is a.s. negative on some (io, oo) (with random) and keep 
c constant. This optimal stopping problem was solved by Meilijson ftj> . Define 

H(x) — supE (/)(a; + Sr) — ct 
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Theorem 10.2 (Meilijson |7S])- Suppose thatEswpt{(j){Bt) - ct} < oo. Then 
H is absolutely continuous and is the minimal solution of the differential equa- 
tion 

H{x)-j-{H'{x)r^^{x). (10.4) 

If (f) is constant on [xq, oo) then H is the unique solution of {10.4)) ^hat equals (j> 
on [xo,oo). The optimal stopping time which yields H(0) is the Azema-Yor 

stopping time given by = inf{i > : Bt < St — ^^-4§^}- 

Let us examine in more detail the result of Meilijson in order to compare 
it with the result of Peskir. The Azema-Yor stopping time is defined as = 
inf{< > : i?f < g{St)} with g{x) ~ x — ^ ■ Let us determine the differential 
equation satisfied by g. Note that H is by definition non-decreasing, so we have 
H'{x) = ^JAc(H(x) - (j){x)). Differentiating once again we obtain 



H"{x) 

Therefore 



2c{H'{x) - (p'{x)) _ 2c{H'{x) - (j>'{x)) 



' 2c R'(x) 2c{x-g{x)) ^ ' 

We recognize immediately the equation (|1U.3|I only there (/)'(s) = 1 and c was 
a function and not a constant. This motivated our investigation of the general 
problem llO.il which is given below. It is given without proof, as it requires a 
fair amount of new notation and more involved ideas. The interested reader is 
referred to [HH]- The following theorem is obtained from Theorem 3.1 in Peskir 

m 

Theorem 10.3. The problem \1U.1]) for a continuously differentiable function 
<f) and c positive, with a countable number of points of discontinuity, has an 
optimal solution with finite payoff if and only if there exists a maximal solution 
9* of 

9'is) = , . , , ... (10.6) 

^c[g{s))[s- g{s))) 

which stays strictly below the diagonal in , i.e. g*(s) < s for all s CzR. In this 
case the payoff is given by 

.0-1(0) 

14 = 0(0) _ 2 / uc{u)du, (10.7) 

where gj is a function given explicitly in \EM- "^^^ stopping time = inf{<: > 
: < 5*(*S't)} is then optimal whenever it satisfies otherwise it is 

"approximately" optimal ^'^ . 

Furthermore if there exists a solution p of the optimal stopping problem 1^10.1]) 
^"In the sense of Peskir 1911 . 
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then P(r* < p) = 1 and satisfies ^U.'Jjj . 

If there is no maximal solution of ^10. 6]) . which stays strictly below the diagonal 
in R^, then = oo and there is no optimal stopping time. 

We suppose the function (/) is strictly increasing and with a continuous deriva- 
tive. The assumptions made by MeiUjson are weaker. All he assumes is that 
(j) is non-decreasing and right-continuous. In particular he also treats the case 
of piece-wise constant (f>. The reasoning in this case is based on an iteration 
procedure. A similar argument can be used here to describe the stopping time 
and the payoff. We refer again to [HS| for the details. 

10.2. The optimal Skorokhod embedding problem 

Consider now the converse problem. That is, given a centered probability mea- 
sure fi describe all pairs of functions ((/>, c) such that the optimal stopping time 
T* which solves H10.1|l exists, and embeds /x, that is Br, ~ /x. 

This may be seen as the "optimal Skorokhod embedding problem" (Peskir 
|92| 1 as we not only specify a method to obtain an embedding for /i but also 
construct an optimal stopping problem, of which this embedding is a solution. 

Again, consider first two special cases. First, let ^(a;) = x. From the The- 
orem 110.11 we know that r* is the Azema-Yor stopping time and the function 

is just the inverse of barycentre function of some measure fi. The problem 
therefore consists in identifying the dependence between p, and c. Suppose for 
simplicity that /i has a strictly positive density / and that it satisfies the L log L- 
integrability condition. 

Theorem 10.4 (Peskir |92p. In the above setup there exists a unique function 

such that the optimal solution t^^ of ^0.1\l embeds ji. This optimal solution is 
then the Azema-Yor stopping time given by ^5.S\) . 

Now let c{x) = c be constant. Then we have 

Theorem 10.5 (Meilijson j75[ ^. In the above setup, there exists a unique 
function (f) defined through \1U.4}) with H'{x) = 2c{x — "^''^^{x)), where is the 
barycentre function given in such that the optimal solution of 1^10.1}} 

embeds ^. This optimal solution is then the Azema-Yor stopping time given by 

roi) . 

To solve the general problem we identify all the pairs {ip, c) such that the 
optimal stopping time which solves (110. 1|) embeds p. in B, B^ ~ /i. This 
is actually quite simple. We know that for t* = mi{t > Bt < g*{St)} we 
have Br, ^ /i if and only if g*{s) — ^'^^(s). This is a direct consequence of 
the Azema-Yor embedding (cf. Section . Let us investigate the differential 
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equation satisfied by 4"^, For simplicity assume that /i has a strictly positive 
density /. It is easy to see that 

Comparing this with the differential equation for (110.6(1 we see that we need 
to have 



(10.10) 



Equivalently, we have 



2ciu) f{uy 



for u e supp{^). (10.11) 



If we take (j){s) = s, so that 0'(s) — 1, we obtain c(u) = 2-p{u) ' ^ ^^^^ '-'^ ^'^^ 
so-called hazard function, which is the result of Theorem 110.41 In general we 
have the following theorem 

Theorem 10.6. Let fi be a probability measure on R with a strictly positive 
density f, which satisfies the L log L-integrability condition. Then the optimal 
stopping time which solves H1U.1\) . embeds fi in B, i.e. Br, fJ,, if and only if 
(0, c) satisfies ^lU.ll]) for all u> 0. The stopping time r* is then the Azema-Yor 
stopping time given by ]5.S\) . 

1 0. 3. Investigating the law of the maximum 

Several authors investigated the laws of the maximum and the minimum of 
stopped processes and their dependence on the terminal and initial laws. We 
will not discuss here all the results as there are too many of them and they 
vary considerably in methods and formulations. We give instead, hopefully a 
quite complete, list of references together with a short description, and we leave 
further research to the reader. 

• Dubins - Schwarz - this article focuses more on Doob-like inequal- 
ities but some useful descriptions and calculations for Brownian motion 
stopping times are provided. Formulae for constant gap stopping times are 
established (see Section ^lU.!]) . 

• Kertz and Rosier 61 - these authors build a martingale with assigned 
terminal and supremum distributions. More precisely, consider a probabil- 
ity measure /i on R with J \x\dp.{x) < oo and any other measure v with 
[i < V < [1* , where p,* is the image of fi through the Hardy- Littlewood max- 
imal function given by 1^5.!^} . Authors show that there exists a martingale 
{Xt)o<t<i such that Xi ^ fi and Si ^ v. It's worth mentioning that the 
converse problem, to prove that if v ^ Si then fi < ly < p* , was solved 
much earlier in Dubins and Gilat ^32t (see also Section \5.5\) . 
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Jacka [S^I - this work is concerned with Dooh-like inequalities. The case of 
reflected Brownian motion and its supremum process is treated. Actually, 
a kind of maximality principle (see Section \1U.1\) in this special case is 
established. 

Rogers |1L)2| - a characterization of possible joint laws of the maximum 
and the terminal value of a uniformly integrable martingale ( or convergent 
continuous local martingale vanishing at zero) is developed. A number of 
previous results from the previous three articles on this list can be deduced. 

Vallois I12()| - the author characterized the set of possible laws of the 
maximum of a continuous, uniformly integrable martingale. As a corollary 
he obtained also a characterization of the set of possible laws of the (ter- 
minal value of) local time at 0. Vallois '1201 also developed a solution to a 
particular version of the Skorokhod embedding problem, which we described 
in Section \H.lfA In lll^f . independently of Rogers '102], Vallois also de- 
scribed the set of possible joint laws of the maximum and the terminal 
value of a continuous, uniformly integrable martingale. 

Hobson - for fixed initial and terminal laws /j,o and fii respectively, 
the author shows that there exists an upper bound, with respect to stochas- 
tic ordering, on the law of Si, where S is the supremum process of a 
martingale {Xt)o<t<i with Xq /iq and Xi ^ fii. The bound is cal- 
culated explicitely and the martingale such that the bound is attained is 
constructed. The methods used are closely connected with those of Chacon 
and Walsh ( see Section \S. iSj) and therefore with our presentation of the 
Azema-Yor solution (see Section\^). 

Pedersen |86| - this Ph.D. thesis contains several chapters dealing with 
optional stopping and the Skorokhod problem. In particular, Doob 's maxi- 
mal LP -inequalities for Bessel processes are discussed. 

Peskir |92j - these are the works discussed above in Sections \10.1\ 

andUEE 

Brown, Hobson and Rogers |13| - this paper is very similar in spirit 
to l^y. The authors investigate an upper bound on the law o/supg<(<2 -M* 
with given Mq = 0, Mi ~ fj,i and M2 ^ IJ-2, where {Mt)o<t<2 is a martin- 
gale. A martingale for which this upper bound is attained is constructed 
using excursion theory. 

Hobson and Pedersen [SS] - in this paper the authors find a lower 
bound on the law of the supremum of a continuous martingale with fixed 
initial and terminal distributions. An explicit construction, involving the 
Skorokhod embedding, is given. The results are applied to the robust hedging 
of a forward start digital option. 

Pedersen |87| - the author solves the optimal stopping problem of stopping 
the Brownian path as close as possible to its unknown ultimate maximum 
over a finite time interval. 
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• Cox and Hobson [23] - see Section FA'. 2(A 
11. Some applications 

The Skorokhod embedding problem is remarkable as it exists in the literature 
for over 40 years by now, but it still comes back in various forms, it poses new 
challenges and is used for new applications. A thorough account of applications 
of this problem would require at least as much space as we have taken so far to 
describe the problem itself and its solutions. With some regret, this is a project 
we shall postpone for the moment . What we will do instead is try to sketch 
the main historical applications and give the the relevant references. 

11.1. Embedding processes 

So far we have considered the problem of embedding, by means of a stopping 
time, a given distribution in a given process. Naturally, once this had been done, 
various applications were being considered. Primarily a question of embedding 
a whole process into, say, Brownian motion, was of great interest. This led to 
the paper of Monroe IHOIj where he showed that all semimartingales are in fact 
time-changed Brownian motions. We will not present here a full account of the 
domain but rather try to sketch the important steps. 

A very basic question consists in representing (in law) a martingale (X„ : n S 
N) as {Bt„ : n G N), where {Bt : i > 0) is a Brownian motion and (T„ : n € N) 
is an increasing family of stopping times. This question goes back to Harris |52j 
and it actually motivated both Skorokhod |115| and Dubins [3T]. The question 
is in fact solved by any solution to the Skorokhod embedding problem. Indeed, 
let {Xn : n e N) be a martingale, Xq = a.s, and be some solution to the 
Skorokhod embedding problem. The idea is then to build consecutive stopping 
times r„ to embed in the new starting Brownian motion (_Bt'„_i+s — St„_i)s>o 
the conditional distribution of (X„+i — given Xi = Bti , ■ ■ ■ Xn-i — Bt,^_i. 

Formal notation is easier if we use the Wiener space. Let uj{t) be the coor- 
dinate process in the space of real, continuous functions, zero at zero, equipped 
with the Wiener measure, so that {uj{t))t>o is a Brownian motion. Fix an em- 
bedding method, which associates to a measure ^ a stopping time which 
embeds it into to: uj(Tfj_{uj)) ~ /i and uj{T^(uj) At:t>0)isa uniformly inte- 
grable martingale. Write /io for the law of Xi and /i„(a;i, . . . , x„) for the regular 
conditional distribution of (X„_|_i — X„) given Xk = Xk, 1 < k < n. Let Ti — T^^ 
and 

T„H =T„_iH+r^„(b,,...,b„)(a;;), (11.1) 

where bi = uj(Ti(uj)) and w^(s) = uj{s + T„_i(a;)) — Lu(Tn-i{Lu)). It is then 
straightforward to see that the following proposition holds. 

Proposition 11.1 (Skorokhod [m| p.l80, Dubins [31]). Let (X-a : n = 

0,1,...) be a real valued martingale with Xq — a.s. and define an increasing 

Likewise, we believe it is very instructive to see explicit forms of various solutions discussed 
in Section|3]for some simple probability measure fi. We plan to develop this idea in a separate 
note. 
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family of stopping times (Tn) through Then uj(Tn{uj))n> has the same 

distribution as {Xn)n>i- Furthermore i/EX,^ = w„ < cxd for all n > 1 then 
E T„ — Vn for all n > 1. 

The next step consisted in extending this resuh to right-continuous mar- 
tingales with continuous time parameter. It was done by Monroe |78| in the 
same paper, where he cstabhshed the properties of minimal stopping times (see 
Section IS)). Finally, as mentioned above, in the subsequent paper Monroe 80j 
established that any semimartingale is a time-changed Brownian motion. Mon- 
roe used randomized stopping times and they were obtained as a limit, so that 
no explicit formulae were given. Zaremba jl25j tried to prove Monroe's results 
using only natural stopping times, but his proof is known to be false. Some 
years later Scott and Huggins (lllj . developed an embedding for reverse mar- 
tingales and established for them, as an application, a functional law of iterated 
logarithm. 

We mention two explicit embedding of processes in Brownian motion by 
Khoshnevisan and Madan and Yor 71 . The former is an embedding of a 
compound Poisson process. The basic idea is as follows. First, we realize jumps 
independently as a sequence of i.i.d. random variables {Vi)°^i. Then we define 
stopping times Ti as consecutive excursions, of height Vi, of Brownian motion B 
away from supremum, that is Tq = 0, T^+i = inf{t > Ti : sup2n.<,j<f i?„ — Bt = 
ViJ^i}. Compound Poisson process is obtained via time-changing the process B, 
so that it is linear, with given slope, in-between the jump times. Khoshnevisan 
imposes some conditions on the law of jumps. He then obtains Strassen's rate 
of convergence (as in below), for renormalizcd compound Poisson process 

to Brownian motion, only in this case uniformly in time on compact intervals. 
Furthermore, he is also able to prove convergence of the number of crossings to 
Brownian local times'^^, uniformly in space and in time (on compact intervals). 

Another explicit embedding of processes in Brownian motion can be found in 
Madan and Yor [7J . The authors provide methods of constructing martingales 
with prescribed marginals. One of their techniques consists in using the Azema- 
Yor solution. More precisely, if {nt)t>o is a family of zero-mean marginals they 
define their martingale through Mt = Bxt , with Tt = mf{u > : S'„ > 'i'^^ (Bu)} 
where ^'^^ is the barycentre function given by if^ . Naturally this gives the 
desired solution only if the family of functions 5"^^ is increasing in t. The authors 
note that this property is equivalent to the statement that the measures /it are 
increasing in time in the mean residual life order (Shaked and Shantikumar 

[n2i,p.43). 

We have considered so far embeddings in law. However, sometimes stronger 
a.s. representations were needed for applications. We mention a series of works 
by Knight (|SS1 EZl)- In his first paper, which originated from his Ph.D. 
dissertation. Knight [SS] developed a construction of Brownian motion as a 
limit (uniform in finite time intervals a.s.) of random walks. He also mentioned 
the converse problem, i.e. the Skorokhod embedding problem, but without using 
this name. Knight had a good control over the rate of convergence for his limit 

-^^The rate of convergence is seen to be n-'^/^(log n)'^''*"'^ , for any e > 0. 
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procedure and the construction generalizes easily to arbitrary diffusions. Knight 
then used this construction to establish the celebrated Ray-Knight theorem 
about the local time as Markov process in space parameter. More recently, 
Knight inZ] used again the construction to describe the law of local time of a 
stopped Brownian motion, given subsequent random walk approximations. This 
study allowed Knight |67| to investigate the law of Brownian motion given its 
local time. 



11.2. Invariance principles 

In Skorokhod's original work |115) the embedding problem was introduced as 
a tool to prove some invariance principles (and their approximation rates) for 
random walks. This was continued afterwards and extended. 

We start with a simple reasoning (Sawyer Meilijson jJJ) which shows 

how to obtain the central limit theorem for i.i.d. variables from the weak law 
of large numbers. Let /i be a centered probability measure with finite second 
moment, J^x'^dfi^x) ~ v < oo, and let T be some solution that embeds /i in B, 
ET = w. Write Ti =T and let T2 be the stopping time T for the new Brownian 
motion Pt = Bt+Ti — Bt^ . Apply this again to obtain T3 etc. We have thus a 
sequence Ti, T2, . . . of i.i.d. random variables, and Sn — t ^ ^'^'^ '^^ 

i.i.d. variables with law fi. Define the process -Bj""* = -^Bnt, which is again a 

Brownian motion, thanks to the scaling property. The pairs [B,B^"-'^) converge 
in law to a pair of two independent Brownian motions {B,f3). The stopping 
times Ti are measurable with respect to the Brownian motion B and, by the 
strong law of large numbers, — y^T^i '^i * ^ ^-^^ This implies that B^^^l-^^ 

converges in law to ~ A/'(0, v), which ends the argument as i?^"l-^„ = 
This reasoning was generalized into: 

Proposition 11.2 (Holewijn and Meilijson jSGp . Let X ~ (Xi : i > 1) be a 

stationary and ergodic process such that EXi = and E(X„|Xi, . . . , X„_i) = 
a.s., n — 2,3,... and (Bt : t > 0) a Brownian motion. Then there exists a 
sequence of standard (randomized) stopping times < Ti < T2 < . . . , such that 

• (Bti , Bt2 , ■ ■ ■) is distributed as X , 

• The process of pairs ( (-Bti , T'l ) , {Bt2 ~ iT2~Ti), . . .) is stationary and 
ergodic, 

• ETi Var{Xi). 

Invariance principle is a generic name for a limit behavior of functionals of 
sums of random variables which proves independent of the distributions of the 
variables. The most known cases include the Central Limit Theorem, the Law 
of Iterated Logarithm and Donsker's theorem. All of these can be obtained via 
the Skorokhod embedding techniques, combined with appropriate properties of 
Brownian motion. We presented a simple example above. An advantage of this 
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approach is that one obtains the desired Hmit behavior together with bounds 
on the rate of approximation. A survey on the subject was written by Sawyer 
|11U| just before the Komlos-Major-Tusnady presented their construction, 
to which we come back below. 

The general setup for invariance principles can be summarized in the follow- 
ing way: construct a sequence of independent random vectors X„, with given 
distributions, and a sequence of independent Gaussian vectors ¥„, each y„ hav- 
ing the same covariance structure as X„, so as to minimize 



A(X,Y):= max iVx, -VfJ. (11.2) 

l<fe<ri K ^ / ^ I 



1=1 



The rate of approximation for the invariance principle expresses how small A is 
as n ^ oo. 

The classical case deals with real i.i.d. variables. Assume X„ are i.i.d., ¥,Xi = 
0, EXf = 1 and EXf = & < cx). To use the Skorokhod embedding to evaluate 
A{X,Y) it suffices to apply Proposition ll 1 . II to the martingale Mk = X]i=i -^k- 
Since the martingale increments are i.i.d. so are the consecutive stopping times' 
differences: {Th+i — Tk), k > 0, where Tq = 0. The Gaussian variables are given 
in a natural way by Yk = Bk — Bk-i, k > 1. The differences in (jll.2|l reduce to 
\Bt^, — Bk\. In his paper Strassen jllfij . proved that 

V T \^T„ - Bn\ /I 1 o^ 

^ = hmsup- — z — z TTT^T, (ll-o) 

„^oo (nloglogn)i/4(logn)i/2' ^ ' 

is of order 0(1). Strassen used the original Skorokhod's randomized embed- 
ding, but the result does not depend on this particular construction of stopping 
times. This subject was further developed by Kiefer who proved that the 
rate ((logn)^nloglogn)^/^ could not be improved under any additional mo- 
ment conditions on the distribution of Xi. The case of independent, but not 
i.i.d. variables was also treated. 

The Skorokhod embedding method was regarded as the best one in prov- 
ing the rates of approximation in invariance principles for some time. However, 
in the middle of the seventies, a new method mentioned above, known as the 
Komlos-Major-Tusnady construction, was developed (see [2HI and [HHli inHj gives 
some extensions) that proved far better than the Skorokhod embedding and pro- 
vided an exhaustive solution to minimizing (|11.2|l . The reason why Skorokhod 
type embedding could not be optimal is quite clear. It was created as a means 
to embed a single distribution in Brownian motion and to embed a sequence 
one just iterates the procedure, while the Komlos-Major-Tusnady construction 
was created from the beginning as an embedding of random walk in Brownian 
motion. 

Still, the Skorokhod embedding techniques can be of some use in this field. 
One possible example involves extending the results known for Brownian mo- 
tion (as the invariance principles with (Ar„) uniform empirical process and (Yn) 
Brownian bridges) to the setup of arbitrary continuous time martingales (see 
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for example Haeusler and Mason 0H1)- Also, Skorokhod embedding might be 
used along with the Komlos-Major-Tusnady construction (see Zhang and Hsu 

m- 

11.3. A list of other applications 

Finally, we just point out several more fields, where Skorokhod embedding is 
used for various purposes. The reader is referred to the original articles for all 
details. 

• Financial mathematics: some examples of applications include moving be- 
tween discrete time and continuous time models (Guo 14 7|) and classical 
Black-Scholes model and other schemes (Walsh and Walsh 122 ). 

• Processes in random environment: random environment version of the 
Skorokhod embedding serves to embed a random walk in a random en- 
vironment into a diffusion process in the random environment (see Hu 

• Study of ^-stable distribution: see Donati-Martin, Ghomrasni and Yor |30|. 

• Invariance principle for the intersection local time - Cadre 14 uses Azema- 
Yor solution (see SectionlSJ to embed a two-dimensional random walk into 
a two-dimensional Brownian motion. He employs this embedding to prove 
that the renormalized intersection local time for planar Brownian motion 
can be obtained as a limit of the corresponding quantity in the discrete- 
time setting. 

• Functional CLT: Courbot [201 uses Skorokhod embedding to study the 
Prohorov distance between the distributions of Brownian motion and a 
square integrable cadlag martingale. 
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